Predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms
- PMID: 15102677
- DOI: 10.1158/1078-0432.ccr-1115-03
Predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms
Abstract
Hereditary predisposition and causative environmental exposures have long been recognized in human malignancies. In most instances, cancer cases occur sporadically, suggesting that environmental influences are critical in determining cancer risk. To test the influence of genetic polymorphisms on breast cancer risk, we have measured 98 single nucleotide polymorphisms (SNPs) distributed over 45 genes of potential relevance to breast cancer etiology in 174 patients and have compared these with matched normal controls. Using machine learning techniques such as support vector machines (SVMs), decision trees, and naïve Bayes, we identified a subset of three SNPs as key discriminators between breast cancer and controls. The SVMs performed maximally among predictive models, achieving 69% predictive power in distinguishing between the two groups, compared with a 50% baseline predictive power obtained from the data after repeated random permutation of class labels (individuals with cancer or controls). However, the simpler naïve Bayes model as well as the decision tree model performed quite similarly to the SVM. The three SNP sites most useful in this model were (a) the +4536T/C site of the aldosterone synthase gene CYP11B2 at amino acid residue 386 Val/Ala (T/C) (rs4541); (b) the +4328C/G site of the aryl hydrocarbon hydroxylase CYP1B1 at amino acid residue 293 Leu/Val (C/G) (rs5292); and (c) the +4449C/T site of the transcription factor BCL6 at amino acid 387 Asp/Asp (rs1056932). No single SNP site on its own could achieve more than 60% in predictive accuracy. We have shown that multiple SNP sites from different genes over distant parts of the genome are better at identifying breast cancer patients than any one SNP alone. As high-throughput technology for SNPs improves and as more SNPs are identified, it is likely that much higher predictive accuracy will be achieved and a useful clinical tool developed.
Similar articles
-
Associations between breast cancer susceptibility gene polymorphisms and clinicopathological features.Clin Cancer Res. 2004 Jan 1;10(1 Pt 1):124-30. doi: 10.1158/1078-0432.ccr-0834-3. Clin Cancer Res. 2004. PMID: 14734460
-
No association between CYP1B1 Val432Leu polymorphism and breast cancer risk: a meta-analysis involving 40,303 subjects.Breast Cancer Res Treat. 2010 Jul;122(1):237-42. doi: 10.1007/s10549-009-0689-2. Epub 2009 Dec 24. Breast Cancer Res Treat. 2010. PMID: 20033481
-
Evaluation of breast cancer risk in a multigenic model including low penetrance genes involved in xenobiotic and estrogen metabolisms.Nutr Cancer. 2010;62(2):243-51. doi: 10.1080/01635580903305300. Nutr Cancer. 2010. PMID: 20099199
-
Three polymorphisms in cytochrome P450 1B1 (CYP1B1) gene and breast cancer risk: a meta-analysis.Breast Cancer Res Treat. 2010 Jul;122(2):545-51. doi: 10.1007/s10549-009-0728-z. Epub 2010 Jan 7. Breast Cancer Res Treat. 2010. PMID: 20054638 Review.
-
Meta- and pooled analyses of the cytochrome P-450 1B1 Val432Leu polymorphism and breast cancer: a HuGE-GSEC review.Am J Epidemiol. 2007 Jan 15;165(2):115-25. doi: 10.1093/aje/kwj365. Epub 2006 Oct 19. Am J Epidemiol. 2007. PMID: 17053044 Review.
Cited by
-
Multivariate models from RNA-Seq SNVs yield candidate molecular targets for biomarker discovery: SNV-DA.BMC Genomics. 2016 Mar 31;17:263. doi: 10.1186/s12864-016-2542-4. BMC Genomics. 2016. PMID: 27029813 Free PMC article.
-
Computational identification of residues that modulate voltage sensitivity of voltage-gated potassium channels.BMC Struct Biol. 2005 Aug 19;5:16. doi: 10.1186/1472-6807-5-16. BMC Struct Biol. 2005. PMID: 16111489 Free PMC article.
-
Identification of type 2 diabetes-associated combination of SNPs using support vector machine.BMC Genet. 2010 Apr 23;11:26. doi: 10.1186/1471-2156-11-26. BMC Genet. 2010. PMID: 20416077 Free PMC article.
-
Genetic risk assessment based on association and prediction studies.Sci Rep. 2023 Sep 14;13(1):15230. doi: 10.1038/s41598-023-41862-3. Sci Rep. 2023. PMID: 37709797 Free PMC article.
-
A support vector machine approach to assess drug efficacy of interferon-alpha and ribavirin combination therapy.Mol Diagn Ther. 2008;12(4):219-23. doi: 10.1007/BF03256287. Mol Diagn Ther. 2008. PMID: 18652518
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials