Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Mar 27:10:267.
doi: 10.3389/fgene.2019.00267. eCollection 2019.

Machine Learning SNP Based Prediction for Precision Medicine

Affiliations
Review

Machine Learning SNP Based Prediction for Precision Medicine

Daniel Sik Wai Ho et al. Front Genet. .

Abstract

In the past decade, precision genomics based medicine has emerged to provide tailored and effective healthcare for patients depending upon their genetic features. Genome Wide Association Studies have also identified population based risk genetic variants for common and complex diseases. In order to meet the full promise of precision medicine, research is attempting to leverage our increasing genomic understanding and further develop personalized medical healthcare through ever more accurate disease risk prediction models. Polygenic risk scoring and machine learning are two primary approaches for disease risk prediction. Despite recent improvements, the results of polygenic risk scoring remain limited due to the approaches that are currently used. By contrast, machine learning algorithms have increased predictive abilities for complex disease risk. This increase in predictive abilities results from the ability of machine learning algorithms to handle multi-dimensional data. Here, we provide an overview of polygenic risk scoring and machine learning in complex disease risk prediction. We highlight recent machine learning application developments and describe how machine learning approaches can lead to improved complex disease prediction, which will help to incorporate genetic features into future personalized healthcare. Finally, we discuss how the future application of machine learning prediction models might help manage complex disease by providing tissue-specific targets for customized, preventive interventions.

Keywords: complex disease risk; genetic disease risk prediction; machine learning; personalized medicine; polygenic risk score; precision medicine.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Workflow for creating a supervised machine learning model from a genotype dataset.
Figure 2
Figure 2
The strengths and weaknesses of polygenic risk scoring and machine learning model.

References

    1. Abraham G., Inouye M. (2015). Genomic risk prediction of complex human disease and its clinical application. Curr. Opin. Genet. Dev. 33 10–16. 10.1016/j.gde.2015.06.005 - DOI - PubMed
    1. Abraham G., Kowalczyk A., Zobel J., Inouye M. (2013). Performance and robustness of penalized and unpenalized methods for genetic prediction of complex human disease. Genet. Epidemiol. 37 184–195. 10.1002/gepi.21698 - DOI - PubMed
    1. Abraham G., Tye-Din J. A., Bhalala O. G., Kowalczyk A., Zobel J., Inouye M. (2014). Accurate and robust genomic prediction of celiac disease using statistical learning. PLoS Genet. 10:e1004137. 10.1371/journal.pgen.1004137 - DOI - PMC - PubMed
    1. Alipanahi B., Delong A., Weirauch M. T., Frey B. J. (2015). Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol. 33 831–838. 10.1038/nbt.3300 - DOI - PubMed
    1. Amin N., Van Duijn C. M., Janssens A. C. J. W. (2009). Genetic scoring analysis: a way forward in genome wide association studies? Eur. J. Epidemiol. 24 585–587. 10.1007/s10654-009-9387-y - DOI - PMC - PubMed