Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 1:2019:baz010.
doi: 10.1093/database/baz010.

Using deep learning to identify translational research in genomic medicine beyond bench to bedside

Affiliations

Using deep learning to identify translational research in genomic medicine beyond bench to bedside

Yi-Yu Hsu et al. Database (Oxford). .

Abstract

Tracking scientific research publications on the evaluation, utility and implementation of genomic applications is critical for the translation of basic research to impact clinical and population health. In this work, we utilize state-of-the-art machine learning approaches to identify translational research in genomics beyond bench to bedside from the biomedical literature. We apply the convolutional neural networks (CNNs) and support vector machines (SVMs) to the bench/bedside article classification on the weekly manual annotation data of the Public Health Genomics Knowledge Base database. Both classifiers employ salient features to determine the probability of curation-eligible publications, which can effectively reduce the workload of manual triage and curation process. We applied the CNNs and SVMs to an independent test set (n = 400), and the models achieved the F-measure of 0.80 and 0.74, respectively. We further tested the CNNs, which perform better results, on the routine annotation pipeline for 2 weeks and significantly reduced the effort and retrieved more appropriate research articles. Our approaches provide direct insight into the automated curation of genomic translational research beyond bench to bedside. The machine learning classifiers are found to be helpful for annotators to enhance the efficiency of manual curation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The CNNs for the TPC task.
Figure 2
Figure 2
Comparing CNN with the baseline date-sort method using ROC curves.

Similar articles

Cited by

References

    1. Khoury M.J., Gwinn M., Yoon P.W. et al. (2007) The continuum of translation research in genomic medicine: how can we accelerate the appropriate integration of human genome discoveries into health care and disease prevention. Genet. Med., 9, 665–674. - PubMed
    1. Clyne M., Schully S.D., Dotson W.D. et al. (2014) Horizon scanning for translational genomic research beyond bench to bedside. Genet. Med., 16, 535–538. - PMC - PubMed
    1. CDC Office of Public Health Genomics Genomic Tests and Family History by Levels of Evidence. https://phgkb.cdc.gov/PHGKB/topicStartPage.action Access date: July, 2018.
    1. Yu W., Gwinn M., Dotson W.D. et al. (2016) A knowledge base for tracking the impact of genomics on population health. Genet. Med., 18, 1312–1314. - PMC - PubMed
    1. Poux S., Arighi C.N., Magrane M. et al. (2017) On expert curation and scalability: UniProtKB/Swiss-Prot as a case study. Bioinformatics, 33, 3454–3460. - PMC - PubMed

Publication types