CNest: A novel copy number association discovery method uncovers 862 new associations from 200,629 whole-exome sequence datasets in the UK Biobank
- PMID: 36779085
- PMCID: PMC9903682
- DOI: 10.1016/j.xgen.2022.100167
CNest: A novel copy number association discovery method uncovers 862 new associations from 200,629 whole-exome sequence datasets in the UK Biobank
Abstract
Copy number variation (CNV) is known to influence human traits, having a rich history of research into common and rare genetic disease, and although CNV is accepted as an important class of genomic variation, progress on copy-number-based genome-wide association studies (GWASs) from next-generation sequencing (NGS) data has been limited. Here we present a novel method for large-scale copy number analysis from NGS data generating robust copy number estimates and allowing copy number GWASs (CN-GWASs) to be performed genome-wide in discovery mode. We provide a detailed analysis in the UK Biobank resource and a specifically designed software package. We use these methods to perform CN-GWAS analysis across 78 human traits, discovering over 800 genetic associations that are likely to contribute strongly to trait distributions. Finally, we compare CNV and SNP association signals across the same traits and samples, defining specific CNV association classes.
Keywords: copy number variation; genome-wide association studies; next-generation sequencing; whole-exome sequencing.
© 2022 The Authors.
Conflict of interest statement
The authors declare no competing interests.
Figures






Similar articles
-
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data.BMC Bioinformatics. 2017 May 31;18(1):286. doi: 10.1186/s12859-017-1705-x. BMC Bioinformatics. 2017. PMID: 28569140 Free PMC article.
-
cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.Nucleic Acids Res. 2012 May;40(9):e69. doi: 10.1093/nar/gks003. Epub 2012 Feb 1. Nucleic Acids Res. 2012. PMID: 22302147 Free PMC article.
-
The individual and global impact of copy-number variants on complex human traits.Am J Hum Genet. 2022 Apr 7;109(4):647-668. doi: 10.1016/j.ajhg.2022.02.010. Epub 2022 Mar 2. Am J Hum Genet. 2022. PMID: 35240056 Free PMC article.
-
Using SAAS-CNV to Detect and Characterize Somatic Copy Number Alterations in Cancer Genomes from Next Generation Sequencing and SNP Array Data.Methods Mol Biol. 2018;1833:29-47. doi: 10.1007/978-1-4939-8666-8_2. Methods Mol Biol. 2018. PMID: 30039361 Review.
-
Progress from genome-wide association studies and copy number variant studies in epilepsy.Curr Opin Neurol. 2016 Apr;29(2):158-67. doi: 10.1097/WCO.0000000000000296. Curr Opin Neurol. 2016. PMID: 26886358 Review.
Cited by
-
Hidden protein-altering variants influence diverse human phenotypes.bioRxiv [Preprint]. 2023 Jun 9:2023.06.07.544066. doi: 10.1101/2023.06.07.544066. bioRxiv. 2023. Update in: Nat Genet. 2024 Apr;56(4):569-578. doi: 10.1038/s41588-024-01684-z. PMID: 37333244 Free PMC article. Updated. Preprint.
-
Structural polymorphism and diversity of human segmental duplications.Nat Genet. 2025 Feb;57(2):390-401. doi: 10.1038/s41588-024-02051-8. Epub 2025 Jan 8. Nat Genet. 2025. PMID: 39779957 Free PMC article.
-
GATK-gCNV enables the discovery of rare copy number variants from exome sequencing data.Nat Genet. 2023 Sep;55(9):1589-1597. doi: 10.1038/s41588-023-01449-0. Epub 2023 Aug 21. Nat Genet. 2023. PMID: 37604963 Free PMC article.
-
Bridging Genomic Research Disparities in Osteoporosis GWAS: Insights for Diverse Populations.Curr Osteoporos Rep. 2025 May 24;23(1):24. doi: 10.1007/s11914-025-00917-2. Curr Osteoporos Rep. 2025. PMID: 40411668 Free PMC article. Review.
-
The Influence of Trinucleotide Repeats in the Androgen Receptor Gene on Androgen-related Traits and Diseases.J Clin Endocrinol Metab. 2024 Nov 18;109(12):3234-3244. doi: 10.1210/clinem/dgae302. J Clin Endocrinol Metab. 2024. PMID: 38701087 Free PMC article.
References
-
- Lee J.J., Wedow R., Okbay A., Kong E., Maghzian O., Zacher M., Nguyen-Viet T.A., Bowers P., Sidorenko J., Karlsson Linnér R., et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 2018;50:1112–1121. - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources