Integrating Clinical Data and Imputed Transcriptome from GWAS to Uncover Complex Disease Subtypes: Applications in Psychiatry and Cardiology
- PMID: 31785786
- PMCID: PMC6904812
- DOI: 10.1016/j.ajhg.2019.10.012
Integrating Clinical Data and Imputed Transcriptome from GWAS to Uncover Complex Disease Subtypes: Applications in Psychiatry and Cardiology
Abstract
Classifying subjects into clinically and biologically homogeneous subgroups will facilitate the understanding of disease pathophysiology and development of targeted prevention and intervention strategies. Traditionally, disease subtyping is based on clinical characteristics alone, but subtypes identified by such an approach may not conform exactly to the underlying biological mechanisms. Very few studies have integrated genomic profiles (e.g., those from GWASs) with clinical symptoms for disease subtyping. Here we proposed an analytic framework capable of finding complex diseases subgroups by leveraging both GWAS-predicted gene expression levels and clinical data by a multi-view bicluster analysis. This approach connects SNPs to genes via their effects on expression, so the analysis is more biologically relevant and interpretable than a pure SNP-based analysis. Transcriptome of different tissues can also be readily modeled. We also proposed various evaluation metrics for assessing clustering performance. Our framework was able to subtype schizophrenia subjects into diverse subgroups with different prognosis and treatment response. We also applied the framework to the Northern Finland Birth Cohort (NFBC) 1966 dataset and identified high and low cardiometabolic risk subgroups in a gender-stratified analysis. The prediction strength by cross-validation was generally greater than 80%, suggesting good stability of the clustering model. Our results suggest a more data-driven and biologically informed approach to defining metabolic syndrome and subtyping psychiatric disorders. Moreover, we found that the genes "blindly" selected by the algorithm are significantly enriched for known susceptibility genes discovered in GWASs of schizophrenia or cardiovascular diseases. The proposed framework opens up an approach to subject stratification.
Keywords: cardiovascular disease; clustering; disease subtyping; gene expression; genome-wide association study; schizophrenia.
Copyright © 2019 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
The authors declare no competing interests.
Figures






Similar articles
-
Leveraging genome-wide association and clinical data in revealing schizophrenia subgroups.J Psychiatr Res. 2018 Nov;106:106-117. doi: 10.1016/j.jpsychires.2018.09.010. Epub 2018 Sep 22. J Psychiatr Res. 2018. PMID: 30312963
-
Integrating brain imaging features and genomic profiles for the subtyping of major depression.Psychol Med. 2025 May 22;55:e158. doi: 10.1017/S0033291725001096. Psychol Med. 2025. PMID: 40400388 Free PMC article.
-
The overlap of genetic susceptibility to schizophrenia and cardiometabolic disease can be used to identify metabolically different groups of individuals.Sci Rep. 2021 Jan 12;11(1):632. doi: 10.1038/s41598-020-79964-x. Sci Rep. 2021. PMID: 33436761 Free PMC article.
-
Genome-wide association studies of schizophrenia: does bigger lead to better results?Curr Opin Psychiatry. 2012 Mar;25(2):76-82. doi: 10.1097/YCO.0b013e32835035dd. Curr Opin Psychiatry. 2012. PMID: 22277805 Free PMC article. Review.
-
Schizophrenia in the genetic era: a review from development history, clinical features and genomic research approaches to insights of susceptibility genes.Metab Brain Dis. 2024 Jan;39(1):147-171. doi: 10.1007/s11011-023-01271-x. Epub 2023 Aug 5. Metab Brain Dis. 2024. PMID: 37542622 Review.
Cited by
-
Turning genome-wide association study findings into opportunities for drug repositioning.Comput Struct Biotechnol J. 2020 Jun 12;18:1639-1650. doi: 10.1016/j.csbj.2020.06.015. eCollection 2020. Comput Struct Biotechnol J. 2020. PMID: 32670504 Free PMC article. Review.
-
Integrative omics of schizophrenia: from genetic determinants to clinical classification and risk prediction.Mol Psychiatry. 2022 Jan;27(1):113-126. doi: 10.1038/s41380-021-01201-2. Epub 2021 Jun 30. Mol Psychiatry. 2022. PMID: 34193973 Free PMC article. Review.
-
Genetic imputation of kidney transcriptome, proteome and multi-omics illuminates new blood pressure and hypertension targets.Nat Commun. 2024 Mar 19;15(1):2359. doi: 10.1038/s41467-024-46132-y. Nat Commun. 2024. PMID: 38504097 Free PMC article.
-
Shaping the Trans-Scale Properties of Schizophrenia via Cerebral Alterations on Magnetic Resonance Imaging and Single-Nucleotide Polymorphisms of Coding and Non-Coding Regions.Front Hum Neurosci. 2021 Sep 9;15:720239. doi: 10.3389/fnhum.2021.720239. eCollection 2021. Front Hum Neurosci. 2021. PMID: 34566604 Free PMC article. Review.
-
Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions.Med Res Rev. 2021 May;41(3):1427-1473. doi: 10.1002/med.21764. Epub 2020 Dec 9. Med Res Rev. 2021. PMID: 33295676 Free PMC article. Review.
References
-
- So H.C., Chau C.K., Chiu W.T., Ho K.S., Lo C.P., Yim S.H., Sham P.C. Analysis of genome-wide association data highlights candidates for drug repositioning in psychiatry. Nat. Neurosci. 2017;20:1342–1349. - PubMed
-
- Arnedo J., Svrakic D.M., Del Val C., Romero-Zaliz R., Hernández-Cuervo H., Fanous A.H., Pato M.T., Pato C.N., de Erausquin G.A., Cloninger C.R., Zwir I., Molecular Genetics of Schizophrenia Consortium Uncovering the hidden risk architecture of the schizophrenias: confirmation in three independent genome-wide association studies. Am. J. Psychiatry. 2015;172:139–153. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical