Predictive modeling of schizophrenia from genomic data: Comparison of polygenic risk score with kernel support vector machines approach
- PMID: 30516002
- PMCID: PMC6492016
- DOI: 10.1002/ajmg.b.32705
Predictive modeling of schizophrenia from genomic data: Comparison of polygenic risk score with kernel support vector machines approach
Abstract
A major controversy in psychiatric genetics is whether nonadditive genetic interaction effects contribute to the risk of highly polygenic disorders. We applied a support vector machines (SVMs) approach, which is capable of building linear and nonlinear models using kernel methods, to classify cases from controls in a large schizophrenia case-control sample of 11,853 subjects (5,554 cases and 6,299 controls) and compared its prediction accuracy with the polygenic risk score (PRS) approach. We also investigated whether SVMs are a suitable approach to detecting nonlinear genetic effects, that is, interactions. We found that PRS provided more accurate case/control classification than either linear or nonlinear SVMs, and give a tentative explanation why PRS outperforms both multivariate regression and linear kernel SVMs. In addition, we observe that nonlinear kernel SVMs showed higher classification accuracy than linear SVMs when a large number of SNPs are entered into the model. We conclude that SVMs are a potential tool for assessing the presence of interactions, prior to searching for them explicitly.
Keywords: polygenic risk score; schizophrenia; support vector machines.
© 2018 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics published by Wiley Periodicals, Inc.
Conflict of interest statement
The authors have no conflict of interest to declare.
Figures
References
-
- Austin, P. C. , & Steyerberg, E. W. (2015). The number of subjects per variable required in linear regression analyses. Journal of Clinical Epidemiology, 68(6), 627–636. - PubMed
-
- Chen, S. H. , Sun, J. , Dimitrov, L. , Turner, A. R. , Adams, T. S. , Meyers, D. A. , … Hsu, F. C. (2008). A support vector machine approach for detecting gene–gene interaction. Genetic Epidemiology, 32(2), 152–167. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
