The genetic interpretation of area under the ROC curve in genomic profiling
- PMID: 20195508
- PMCID: PMC2829056
- DOI: 10.1371/journal.pgen.1000864
The genetic interpretation of area under the ROC curve in genomic profiling
Abstract
Genome-wide association studies in human populations have facilitated the creation of genomic profiles which combine the effects of many associated genetic variants to predict risk of disease. The area under the receiver operator characteristic (ROC) curve is a well established measure for determining the efficacy of tests in correctly classifying diseased and non-diseased individuals. We use quantitative genetics theory to provide insight into the genetic interpretation of the area under the ROC curve (AUC) when the test classifier is a predictor of genetic risk. Even when the proportion of genetic variance explained by the test is 100%, there is a maximum value for AUC that depends on the genetic epidemiology of the disease, i.e. either the sibling recurrence risk or heritability and disease prevalence. We derive an equation relating maximum AUC to heritability and disease prevalence. The expression can be reversed to calculate the proportion of genetic variance explained given AUC, disease prevalence, and heritability. We use published estimates of disease prevalence and sibling recurrence risk for 17 complex genetic diseases to calculate the proportion of genetic variance that a test must explain to achieve AUC = 0.75; this varied from 0.10 to 0.74. We provide a genetic interpretation of AUC for use with predictors of genetic risk based on genomic profiles. We provide a strategy to estimate proportion of genetic variance explained on the liability scale from estimates of AUC, disease prevalence, and heritability (or sibling recurrence risk) available as an online calculator.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures



Similar articles
-
Sibling method increases risk assessment estimates for type 1 diabetes.PLoS One. 2017 May 16;12(5):e0176341. doi: 10.1371/journal.pone.0176341. eCollection 2017. PLoS One. 2017. PMID: 28510587 Free PMC article.
-
A unifying framework for evaluating the predictive power of genetic variants based on the level of heritability explained.PLoS Genet. 2010 Dec 2;6(12):e1001230. doi: 10.1371/journal.pgen.1001230. PLoS Genet. 2010. PMID: 21151957 Free PMC article.
-
A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms.BMC Med Inform Decis Mak. 2020 Jan 6;20(1):4. doi: 10.1186/s12911-019-1014-6. BMC Med Inform Decis Mak. 2020. PMID: 31906931 Free PMC article.
-
AUC-RF: a new strategy for genomic profiling with random forest.Hum Hered. 2011;72(2):121-32. doi: 10.1159/000330778. Epub 2011 Oct 11. Hum Hered. 2011. PMID: 21996641
-
A program for computing the prediction probability and the related receiver operating characteristic graph.Anesth Analg. 2010 Dec;111(6):1416-21. doi: 10.1213/ANE.0b013e3181fb919e. Epub 2010 Nov 8. Anesth Analg. 2010. PMID: 21059744 Review.
Cited by
-
Genetics and the individualized prediction of fracture.Curr Osteoporos Rep. 2012 Sep;10(3):236-44. doi: 10.1007/s11914-012-0113-4. Curr Osteoporos Rep. 2012. PMID: 22851044 Review.
-
Analytical and simulation methods for estimating the potential predictive ability of genetic profiling: a comparison of methods and results.Eur J Hum Genet. 2012 Dec;20(12):1270-4. doi: 10.1038/ejhg.2012.89. Epub 2012 May 30. Eur J Hum Genet. 2012. PMID: 22643180 Free PMC article.
-
Hints of hidden heritability in GWAS.Nat Genet. 2010 Jul;42(7):558-60. doi: 10.1038/ng0710-558. Nat Genet. 2010. PMID: 20581876
-
Being more realistic about the public health impact of genomic medicine.PLoS Med. 2010 Oct 12;7(10):e1000347. doi: 10.1371/journal.pmed.1000347. PLoS Med. 2010. PMID: 20967240 Free PMC article. Review.
-
Complementary Experimental Methods in Genetics Open Up New Avenues of Research to Elucidate the Pathogenesis of Periodontitis.Adv Exp Med Biol. 2022;1373:209-227. doi: 10.1007/978-3-030-96881-6_11. Adv Exp Med Biol. 2022. PMID: 35612800
References
-
- McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature Reviews Genetics. 2008;9:356–369. - PubMed
-
- Iles MM. What can genome-wide association studies tell us about the genetics of common disease? PLoS Genet. 2008;4:e33. doi: 10.1371/journal.pgen.0040033. - DOI - PMC - PubMed
-
- Janssens AC, Aulchenko YS, Elefante S, Borsboom GJ, Steyerberg EW, et al. Predictive testing for complex diseases using multiple genes: fact or fiction? Genet Med. 2006;8:395–400. - PubMed
-
- Kraft P, Wacholder S, Cornelis MC, Hu FB, Hayes RB, et al. OPINION Beyond odds - ratios communicating disease risk based on genetic profiles. Nature Reviews Genetics. 2009;10:264–269. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources