A phenome-wide association study identifies effects of copy-number variation of VNTRs and multicopy genes on multiple human traits
- PMID: 35609568
- PMCID: PMC9247821
- DOI: 10.1016/j.ajhg.2022.04.016
A phenome-wide association study identifies effects of copy-number variation of VNTRs and multicopy genes on multiple human traits
Abstract
The human genome contains tens of thousands of large tandem repeats and hundreds of genes that show common and highly variable copy-number changes. Due to their large size and repetitive nature, these variable number tandem repeats (VNTRs) and multicopy genes are generally recalcitrant to standard genotyping approaches and, as a result, this class of variation is poorly characterized. However, several recent studies have demonstrated that copy-number variation of VNTRs can modify local gene expression, epigenetics, and human traits, indicating that many have a functional role. Here, using read depth from whole-genome sequencing to profile copy number, we report results of a phenome-wide association study (PheWAS) of VNTRs and multicopy genes in a discovery cohort of ∼35,000 samples, identifying 32 traits associated with copy number of 38 VNTRs and multicopy genes at 1% FDR. We replicated many of these signals in an independent cohort and observed that VNTRs showing trait associations were significantly enriched for expression QTLs with nearby genes, providing strong support for our results. Fine-mapping studies indicated that in the majority (∼90%) of cases, the VNTRs and multicopy genes we identified represent the causal variants underlying the observed associations. Furthermore, several lie in regions where prior SNV-based GWASs have failed to identify any significant associations with these traits. Our study indicates that copy number of VNTRs and multicopy genes contributes to diverse human traits and suggests that complex structural variants potentially explain some of the so-called "missing heritability" of SNV-based GWASs.
Keywords: CNV; GWAS; read depth; tandem repeat; variable number tandem repeat.
Copyright © 2022 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests The authors declare no competing interests.
Figures




Similar articles
-
Pervasive cis effects of variation in copy number of large tandem repeats on local DNA methylation and gene expression.Am J Hum Genet. 2021 May 6;108(5):809-824. doi: 10.1016/j.ajhg.2021.03.016. Epub 2021 Mar 31. Am J Hum Genet. 2021. PMID: 33794196 Free PMC article.
-
The association of insertions/deletions (INDELs) and variable number tandem repeats (VNTRs) with obesity and its related traits and complications.J Physiol Anthropol. 2017 Jun 14;36(1):25. doi: 10.1186/s40101-017-0142-x. J Physiol Anthropol. 2017. PMID: 28615046 Free PMC article. Review.
-
Digital genotyping of macrosatellites and multicopy genes reveals novel biological functions associated with copy number variation of large tandem repeats.PLoS Genet. 2014 Jun 19;10(6):e1004418. doi: 10.1371/journal.pgen.1004418. eCollection 2014 Jun. PLoS Genet. 2014. PMID: 24945355 Free PMC article.
-
Genome-wide prediction of human VNTRs.Genomics. 2005 Jan;85(1):24-35. doi: 10.1016/j.ygeno.2004.10.009. Genomics. 2005. PMID: 15607419
-
The impact of human copy number variation on gene expression.Brief Funct Genomics. 2015 Sep;14(5):352-7. doi: 10.1093/bfgp/elv017. Epub 2015 Apr 27. Brief Funct Genomics. 2015. PMID: 25922366 Free PMC article. Review.
Cited by
-
Genome-wide investigation of VNTR motif polymorphisms in 8,222 genomes: Implications for biological regulation and human traits.Cell Genom. 2024 Dec 11;4(12):100699. doi: 10.1016/j.xgen.2024.100699. Epub 2024 Nov 27. Cell Genom. 2024. PMID: 39609246 Free PMC article.
-
SUMO protease FUG1, histone reader AL3 and chromodomain protein LHP1 are integral to repeat expansion-induced gene silencing in Arabidopsis thaliana.Nat Plants. 2024 May;10(5):749-759. doi: 10.1038/s41477-024-01672-5. Epub 2024 Apr 19. Nat Plants. 2024. PMID: 38641663
-
TRGT-ing the dark genome to accurately characterize tandem repeats at scale.Nat Biotechnol. 2024 Oct;42(10):1504-1505. doi: 10.1038/s41587-023-02073-3. Nat Biotechnol. 2024. PMID: 38168998 No abstract available.
-
Repeat polymorphisms underlie top genetic risk loci for glaucoma and colorectal cancer.Cell. 2023 Aug 17;186(17):3659-3673.e23. doi: 10.1016/j.cell.2023.07.002. Epub 2023 Jul 31. Cell. 2023. PMID: 37527660 Free PMC article.
-
A phenome-wide association study of methylated GC-rich repeats identifies a GCC repeat expansion in AFF3 associated with intellectual disability.Nat Genet. 2024 Nov;56(11):2322-2332. doi: 10.1038/s41588-024-01917-1. Epub 2024 Sep 23. Nat Genet. 2024. PMID: 39313615
References
-
- Chaisson M.J.P., Sanders A.D., Zhao X., Malhotra A., Porubsky D., Rausch T., Gardner E.J., Rodriguez O.L., Guo L., Collins R.L., et al. Multi-platform discovery of haplotype-resolved structural variation in human genomes. Nat. Commun. 2019;10:1784. doi: 10.1038/s41467-018-08148-z. - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources