GW-SEM: A Statistical Package to Conduct Genome-Wide Structural Equation Modeling
- PMID: 28299468
- PMCID: PMC5423544
- DOI: 10.1007/s10519-017-9842-6
GW-SEM: A Statistical Package to Conduct Genome-Wide Structural Equation Modeling
Abstract
Improving the accuracy of phenotyping through the use of advanced psychometric tools will increase the power to find significant associations with genetic variants and expand the range of possible hypotheses that can be tested on a genome-wide scale. Multivariate methods, such as structural equation modeling (SEM), are valuable in the phenotypic analysis of psychiatric and substance use phenotypes, but these methods have not been integrated into standard genome-wide association analyses because fitting a SEM at each single nucleotide polymorphism (SNP) along the genome was hitherto considered to be too computationally demanding. By developing a method that can efficiently fit SEMs, it is possible to expand the set of models that can be tested. This is particularly necessary in psychiatric and behavioral genetics, where the statistical methods are often handicapped by phenotypes with large components of stochastic variance. Due to the enormous amount of data that genome-wide scans produce, the statistical methods used to analyze the data are relatively elementary and do not directly correspond with the rich theoretical development, and lack the potential to test more complex hypotheses about the measurement of, and interaction between, comorbid traits. In this paper, we present a method to test the association of a SNP with multiple phenotypes or a latent construct on a genome-wide basis using a diagonally weighted least squares (DWLS) estimator for four common SEMs: a one-factor model, a one-factor residuals model, a two-factor model, and a latent growth model. We demonstrate that the DWLS parameters and p-values strongly correspond with the more traditional full information maximum likelihood parameters and p-values. We also present the timing of simulations and power analyses and a comparison with and existing multivariate GWAS software package.
Keywords: DWLS; Diagonally weighted least squares; GWAS; Genetics; Genome-wide association study; SEM; Structural equation modeling.
Conflict of interest statement
Figures



Similar articles
-
GW-SEM 2.0: Efficient, Flexible, and Accessible Multivariate GWAS.Behav Genet. 2021 May;51(3):343-357. doi: 10.1007/s10519-021-10043-1. Epub 2021 Feb 19. Behav Genet. 2021. PMID: 33604756
-
An efficient genome-wide association test for multivariate phenotypes based on the Fisher combination function.BMC Bioinformatics. 2016 Jan 5;17:19. doi: 10.1186/s12859-015-0868-6. BMC Bioinformatics. 2016. PMID: 26729364 Free PMC article.
-
Genome-wide Analysis of Large-scale Longitudinal Outcomes using Penalization -GALLOP algorithm.Sci Rep. 2018 May 1;8(1):6815. doi: 10.1038/s41598-018-24578-7. Sci Rep. 2018. PMID: 29717146 Free PMC article.
-
Software engineering the mixed model for genome-wide association studies on large samples.Brief Bioinform. 2009 Nov;10(6):664-75. doi: 10.1093/bib/bbp050. Brief Bioinform. 2009. PMID: 19933212 Review.
-
Single Marker Family-Based Association Analysis Not Conditional on Parental Information.Methods Mol Biol. 2017;1666:409-439. doi: 10.1007/978-1-4939-7274-6_20. Methods Mol Biol. 2017. PMID: 28980257 Review.
Cited by
-
Multi-trait multi-locus SEM model discriminates SNPs of different effects.BMC Genomics. 2020 Jul 28;21(Suppl 8):490. doi: 10.1186/s12864-020-06833-2. BMC Genomics. 2020. PMID: 32723302 Free PMC article.
-
Patterns of item nonresponse behaviour to survey questionnaires are systematic and associated with genetic loci.Nat Hum Behav. 2023 Aug;7(8):1371-1387. doi: 10.1038/s41562-023-01632-7. Epub 2023 Jun 29. Nat Hum Behav. 2023. PMID: 37386106 Free PMC article.
-
A web-based survey on various symptoms of computer vision syndrome and the genetic understanding based on a multi-trait genome-wide association study.Sci Rep. 2021 May 3;11(1):9446. doi: 10.1038/s41598-021-88827-y. Sci Rep. 2021. PMID: 33941792 Free PMC article.
-
Using Genetic Marginal Effects to Study Gene-Environment Interactions with GWAS Data.Behav Genet. 2021 May;51(3):358-373. doi: 10.1007/s10519-021-10058-8. Epub 2021 Apr 26. Behav Genet. 2021. PMID: 33899139
-
Leveraging pleiotropy for the improved treatment of psychiatric disorders.Mol Psychiatry. 2025 Feb;30(2):705-721. doi: 10.1038/s41380-024-02771-7. Epub 2024 Oct 10. Mol Psychiatry. 2025. PMID: 39390223 Free PMC article. Review.
References
-
- Abecasis GR, Cherny SS, Cookson WO, Cardon LR. Merlinrapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet. 2002 Jan;30(1):97–101. - PubMed
-
- Agresti A. Categorical data analysis. second. Wiley-Interscience; 2002.
-
- Bock RD, Aitkin M. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika. 1981;46(4):443459.
-
- Boker SM, Neale MC, Maes HH, Wilde MJ, Spiegel M, Brick TR, Driver C. Openmx 2.3.1 user guide [Computer software manual] 2015.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases