DeepWAS: Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning
- PMID: 32012148
- PMCID: PMC7043350
- DOI: 10.1371/journal.pcbi.1007616
DeepWAS: Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning
Abstract
Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe "DeepWAS", a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to 61 regulatory SNPs, called dSNPs, were associated with multiple sclerosis (MS, 4,888 cases and 10,395 controls), major depressive disorder (MDD, 1,475 cases and 2,144 controls), and height (5,974 individuals). These variants were mainly non-coding and reached at least nominal significance in classical GWAS. The prediction accuracy was higher for DeepWAS than for classical GWAS models for 91% of the genome-wide significant, MS-specific dSNPs. DSNPs were enriched in public or cohort-matched expression and methylation quantitative trait loci and we demonstrated the potential of DeepWAS to generate testable functional hypotheses based on genotype data alone. DeepWAS is available at https://github.com/cellmapslab/DeepWAS.
Conflict of interest statement
The authors declare that no competing interests exist.
Figures





Similar articles
-
Endometrial vezatin and its association with endometriosis risk.Hum Reprod. 2016 May;31(5):999-1013. doi: 10.1093/humrep/dew047. Epub 2016 Mar 22. Hum Reprod. 2016. PMID: 27005890
-
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961. Bioinformatics. 2019. PMID: 30476000 Free PMC article.
-
Combining artificial intelligence: deep learning with Hi-C data to predict the functional effects of non-coding variants.Bioinformatics. 2021 Jun 16;37(10):1339-1344. doi: 10.1093/bioinformatics/btaa970. Bioinformatics. 2021. PMID: 33196774 Free PMC article.
-
Deep learning predicts DNA methylation regulatory variants in the human brain and elucidates the genetics of psychiatric disorders.Proc Natl Acad Sci U S A. 2022 Aug 23;119(34):e2206069119. doi: 10.1073/pnas.2206069119. Epub 2022 Aug 15. Proc Natl Acad Sci U S A. 2022. PMID: 35969790 Free PMC article.
-
Expression Quantitative Trait Loci Information Improves Predictive Modeling of Disease Relevance of Non-Coding Genetic Variation.PLoS One. 2015 Oct 16;10(10):e0140758. doi: 10.1371/journal.pone.0140758. eCollection 2015. PLoS One. 2015. PMID: 26474488 Free PMC article. Review.
Cited by
-
MOSTWAS: Multi-Omic Strategies for Transcriptome-Wide Association Studies.PLoS Genet. 2021 Mar 8;17(3):e1009398. doi: 10.1371/journal.pgen.1009398. eCollection 2021 Mar. PLoS Genet. 2021. PMID: 33684137 Free PMC article.
-
DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype-phenotype prediction.Genome Med. 2023 Oct 31;15(1):88. doi: 10.1186/s13073-023-01248-6. Genome Med. 2023. PMID: 37904203 Free PMC article.
-
Identifying Depression Through Machine Learning Analysis of Omics Data: Scoping Review.JMIR Nurs. 2024 Jul 19;7:e54810. doi: 10.2196/54810. JMIR Nurs. 2024. PMID: 39028994 Free PMC article.
-
What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics.Hum Genet. 2022 Sep;141(9):1515-1528. doi: 10.1007/s00439-021-02402-z. Epub 2021 Dec 4. Hum Genet. 2022. PMID: 34862561 Free PMC article. Review.
-
Scalable approaches for functional analyses of whole-genome sequencing non-coding variants.Hum Mol Genet. 2022 Oct 20;31(R1):R62-R72. doi: 10.1093/hmg/ddac191. Hum Mol Genet. 2022. PMID: 35943817 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Molecular Biology Databases