Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 5:2018:750-759.
eCollection 2018.

Using Convolutional Neural Networks to Support Insertion of New Concepts into SNOMED CT

Affiliations

Using Convolutional Neural Networks to Support Insertion of New Concepts into SNOMED CT

Hao Liu et al. AMIA Annu Symp Proc. .

Abstract

Many major medical ontologies go through a regular (bi-annual, monthly, etc.) release cycle. A new release will contain corrections to the previous release, as well as genuinely new concepts that are the result of either user requests or new developments in the domain. New concepts need to be placed at the correct place in the ontology hierarchy. Traditionally, this is done by an expert modeling a new concept and running a classifier algorithm. We propose an alternative approach that is based on providing only the name of a new concept and using a Convolutional Neural Network-based machine learning method. We first tested this approach within one version of SNOMED CT and achieved an average 88.5% precision and an F1 score of 0.793. In comparing the July 2017 release with the January 2018 release, limiting ourselves to predicting one out of two or more parents, our average F1 score was 0.701.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Serializing the hierarchical structure of one concept into one document
Figure 2:
Figure 2:
Data flow during vectorization phase. D = 336,893 is the total number of concepts.
Figure 3:
Figure 3:
Data flow during training the CNN model. Test data is also generated “on the fly.”
Figure 4:
Figure 4:
Data flow during testing CNN mode phase
Figure 5:
Figure 5:
Data flow during testing CNN model phase with new data (use case)

References

    1. Shearer R, Motik B, Horrocks I. OWLED; 2008. editors. HermiT: A Highly-Efficient OWL Reasoner.
    1. Phyu TN. 2009. editor Survey of classification techniques in data mining. Proceedings of the International MultiConference of Engineers and Computer Scientists.
    1. Lin Y, Shen S, Liu Z, Luan H, Sun M. editors. Neural relation extraction with selective attention over instances. Proceedings of the 54th Annual Meeting of the Assoc. for Computational Linguistics (V1: Long Papers).2016.
    1. Le QV, Mikolov T. 2014. Distributed Representations of Sentences and Documents. CoRR. abs/1405.4053.
    1. Spackman KA. American Medical Informatics Association; 2001. editor Normal forms for description logic expressions of clinical concepts in SNOMED RT. Proceedings of the AMIA Symposium. - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources