Coupled mixed model for joint genetic analysis of complex disorders with two independently collected data sets
- PMID: 33546598
- PMCID: PMC7866684
- DOI: 10.1186/s12859-021-03959-2
Coupled mixed model for joint genetic analysis of complex disorders with two independently collected data sets
Abstract
Background: In the last decade, Genome-wide Association studies (GWASs) have contributed to decoding the human genome by uncovering many genetic variations associated with various diseases. Many follow-up investigations involve joint analysis of multiple independently generated GWAS data sets. While most of the computational approaches developed for joint analysis are based on summary statistics, the joint analysis based on individual-level data with consideration of confounding factors remains to be a challenge.
Results: In this study, we propose a method, called Coupled Mixed Model (CMM), that enables a joint GWAS analysis on two independently collected sets of GWAS data with different phenotypes. The CMM method does not require the data sets to have the same phenotypes as it aims to infer the unknown phenotypes using a set of multivariate sparse mixed models. Moreover, CMM addresses the confounding variables due to population stratification, family structures, and cryptic relatedness, as well as those arising during data collection such as batch effects that frequently appear in joint genetic studies. We evaluate the performance of CMM using simulation experiments. In real data analysis, we illustrate the utility of CMM by an application to evaluating common genetic associations for Alzheimer's disease and substance use disorder using datasets independently collected for the two complex human disorders. Comparison of the results with those from previous experiments and analyses supports the utility of our method and provides new insights into the diseases. The software is available at https://github.com/HaohanWang/CMM .
Keywords: Deconfounding; Joint analysis; Mixed model.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures



Similar articles
-
Control for population stratification in genetic association studies based on GWAS summary statistics.Genet Epidemiol. 2022 Dec;46(8):604-614. doi: 10.1002/gepi.22493. Epub 2022 Jun 29. Genet Epidemiol. 2022. PMID: 35766057
-
Joint analysis of individual-level and summary-level GWAS data by leveraging pleiotropy.Bioinformatics. 2019 May 15;35(10):1729-1736. doi: 10.1093/bioinformatics/bty870. Bioinformatics. 2019. PMID: 30307540
-
Deep mixed model for marginal epistasis detection and population stratification correction in genome-wide association studies.BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):656. doi: 10.1186/s12859-019-3300-9. BMC Bioinformatics. 2019. PMID: 31881907 Free PMC article.
-
Software engineering the mixed model for genome-wide association studies on large samples.Brief Bioinform. 2009 Nov;10(6):664-75. doi: 10.1093/bib/bbp050. Brief Bioinform. 2009. PMID: 19933212 Review.
-
Recent innovations and in-depth aspects of post-genome wide association study (Post-GWAS) to understand the genetic basis of complex phenotypes.Heredity (Edinb). 2021 Dec;127(6):485-497. doi: 10.1038/s41437-021-00479-w. Epub 2021 Oct 23. Heredity (Edinb). 2021. PMID: 34689168 Free PMC article. Review.
Cited by
-
Comparison of two multi-trait association testing methods and sequence-based fine mapping of six additive QTL in Swiss Large White pigs.BMC Genomics. 2023 Apr 10;24(1):192. doi: 10.1186/s12864-023-09295-4. BMC Genomics. 2023. PMID: 37038103 Free PMC article.
References
-
- Mukherjee S, Thornton T, Naj A, Kim S, Kauwe J, Fardo D, Valladares O, Wijsman E, Schellenberg G, Crane P. GWAS of the joint ADGC data set identifies novel common variants associated with late-onset Alzheimer’s disease. Alzheimer’s Dement J Alzheimer’s Assoc. 2013;9(4):550. doi: 10.1016/j.jalz.2013.05.1071. - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources