Biomedical informatics and machine learning for clinical genomics
- PMID: 29566172
- PMCID: PMC5946905
- DOI: 10.1093/hmg/ddy088
Biomedical informatics and machine learning for clinical genomics
Abstract
While tens of thousands of pathogenic variants are used to inform the many clinical applications of genomics, there remains limited information on quantitative disease risk for the majority of variants used in clinical practice. At the same time, rising demand for genetic counselling has prompted a growing need for computational approaches that can help interpret genetic variation. Such tasks include predicting variant pathogenicity and identifying variants that are too common to be penetrant. To address these challenges, researchers are increasingly turning to integrative informatics approaches. These approaches often leverage vast sources of data, including electronic health records and population-level allele frequency databases (e.g. gnomAD), as well as machine learning techniques such as support vector machines and deep learning. In this review, we highlight recent informatics and machine learning approaches that are improving our understanding of pathogenic variation and discuss obstacles that may limit their emerging role in clinical genomics.
Figures

Similar articles
-
Incorporating Machine Learning into Established Bioinformatics Frameworks.Int J Mol Sci. 2021 Mar 12;22(6):2903. doi: 10.3390/ijms22062903. Int J Mol Sci. 2021. PMID: 33809353 Free PMC article. Review.
-
The Kipoi repository accelerates community exchange and reuse of predictive models for genomics.Nat Biotechnol. 2019 Jun;37(6):592-600. doi: 10.1038/s41587-019-0140-0. Nat Biotechnol. 2019. PMID: 31138913 Free PMC article. No abstract available.
-
Machine learning random forest for predicting oncosomatic variant NGS analysis.Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y. Sci Rep. 2021. PMID: 34750410 Free PMC article.
-
Bioinformatics and genomic medicine.Genet Med. 2002 Nov-Dec;4(6 Suppl):62S-65S. doi: 10.1097/00125817-200211001-00013. Genet Med. 2002. PMID: 12544491 Review.
-
INDELpred: Improving the prediction and interpretation of indel pathogenicity within the clinical genome.HGG Adv. 2024 Oct 10;5(4):100325. doi: 10.1016/j.xhgg.2024.100325. Epub 2024 Jul 10. HGG Adv. 2024. PMID: 38993112 Free PMC article.
Cited by
-
Artificial Intelligence and Precision Medicine: A Perspective.Adv Exp Med Biol. 2022;1375:1-11. doi: 10.1007/5584_2021_652. Adv Exp Med Biol. 2022. PMID: 34138457
-
Classification of porcine reproductive and respiratory syndrome clinical impact in Ontario sow herds using machine learning approaches.Front Vet Sci. 2023 Jun 7;10:1175569. doi: 10.3389/fvets.2023.1175569. eCollection 2023. Front Vet Sci. 2023. PMID: 37351555 Free PMC article.
-
Effective Cancer Subtype and Stage Prediction via Dropfeature-DNNs.IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):107-120. doi: 10.1109/TCBB.2021.3058941. Epub 2022 Feb 3. IEEE/ACM Trans Comput Biol Bioinform. 2022. PMID: 33577454 Free PMC article.
-
Uric Acid and Gluconic Acid as Predictors of Hyperglycemia and Cytotoxic Injury after Stroke.Transl Stroke Res. 2021 Apr;12(2):293-302. doi: 10.1007/s12975-020-00862-5. Epub 2020 Oct 17. Transl Stroke Res. 2021. PMID: 33067777 Free PMC article.
-
Epistemic Rights and Responsibilities of Digital Simulacra for Biomedicine.Am J Bioeth. 2023 Sep;23(9):43-54. doi: 10.1080/15265161.2022.2146785. Epub 2022 Dec 12. Am J Bioeth. 2023. PMID: 36507873 Free PMC article.
References
-
- Bamshad M.J., Ng S.B., Bigham A.W., Tabor H.K., Emond M.J., Nickerson D.A., Shendure J. (2011) Exome sequencing as a tool for Mendelian disease gene discovery. Nat. Rev. Genet., 12, 745–755. - PubMed
-
- Manrai A.K., Ioannidis J.P.A., Kohane I.S. (2016) Clinical genomics: from pathogenicity claims to quantitative risk estimates. JAMA, 315, 1233–1234. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources