CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing
- PMID: 27100738
- PMCID: PMC4839673
- DOI: 10.1371/journal.pcbi.1004873
CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing
Abstract
Germline copy number variants (CNVs) and somatic copy number alterations (SCNAs) are of significant importance in syndromic conditions and cancer. Massively parallel sequencing is increasingly used to infer copy number information from variations in the read depth in sequencing data. However, this approach has limitations in the case of targeted re-sequencing, which leaves gaps in coverage between the regions chosen for enrichment and introduces biases related to the efficiency of target capture and library preparation. We present a method for copy number detection, implemented in the software package CNVkit, that uses both the targeted reads and the nonspecifically captured off-target reads to infer copy number evenly across the genome. This combination achieves both exon-level resolution in targeted regions and sufficient resolution in the larger intronic and intergenic regions to identify copy number changes. In particular, we successfully inferred copy number at equivalent to 100-kilobase resolution genome-wide from a platform targeting as few as 293 genes. After normalizing read counts to a pooled reference, we evaluated and corrected for three sources of bias that explain most of the extraneous variability in the sequencing read depth: GC content, target footprint size and spacing, and repetitive sequences. We compared the performance of CNVkit to copy number changes identified by array comparative genomic hybridization. We packaged the components of CNVkit so that it is straightforward to use and provides visualizations, detailed reporting of significant features, and export options for integration into existing analysis pipelines. CNVkit is freely available from https://github.com/etal/cnvkit.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures







Similar articles
-
iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018. PLoS One. 2018. PMID: 29621297 Free PMC article.
-
Identification and utilization of copy number information for correcting Hi-C contact map of cancer cell lines.BMC Bioinformatics. 2020 Nov 7;21(1):506. doi: 10.1186/s12859-020-03832-8. BMC Bioinformatics. 2020. PMID: 33160308 Free PMC article.
-
Evaluation of copy number variant detection from panel-based next-generation sequencing data.Mol Genet Genomic Med. 2019 Jan;7(1):e00513. doi: 10.1002/mgg3.513. Epub 2018 Nov 22. Mol Genet Genomic Med. 2019. PMID: 30565893 Free PMC article.
-
Deciphering new insights into copy number variations as drivers of genomic diversity and adaptation in farm animal species.Gene. 2025 Mar 5;939:149159. doi: 10.1016/j.gene.2024.149159. Epub 2024 Dec 11. Gene. 2025. PMID: 39672215 Review.
-
A survey of analysis software for array-comparative genomic hybridisation studies to detect copy number variation.Hum Genomics. 2010 Aug;4(6):421-7. doi: 10.1186/1479-7364-4-6-421. Hum Genomics. 2010. PMID: 20846932 Free PMC article. Review.
Cited by
-
Aneuploidy as a driver of human cancer.Nat Genet. 2024 Oct;56(10):2014-2026. doi: 10.1038/s41588-024-01916-2. Epub 2024 Oct 2. Nat Genet. 2024. PMID: 39358600 Review.
-
Identification of tumor-infiltrating lymphocyte subpopulations correlated with patient prognosis in esophageal squamous cell carcinoma.J Int Med Res. 2021 May;49(5):3000605211016206. doi: 10.1177/03000605211016206. J Int Med Res. 2021. PMID: 34044599 Free PMC article.
-
A novel genomic classification system of gastric cancer via integrating multidimensional genomic characteristics.Gastric Cancer. 2021 Nov;24(6):1227-1241. doi: 10.1007/s10120-021-01201-9. Epub 2021 Jun 6. Gastric Cancer. 2021. PMID: 34095982 Free PMC article.
-
Development and Analytical Validation of a Targeted Next-Generation Sequencing Panel to Detect Actionable Mutations for Targeted Therapy.Onco Targets Ther. 2021 Apr 7;14:2423-2431. doi: 10.2147/OTT.S299381. eCollection 2021. Onco Targets Ther. 2021. PMID: 33854338 Free PMC article.
-
Copy Number Variant Detection with Low-Coverage Whole-Genome Sequencing Represents a Viable Alternative to the Conventional Array-CGH.Diagnostics (Basel). 2021 Apr 15;11(4):708. doi: 10.3390/diagnostics11040708. Diagnostics (Basel). 2021. PMID: 33920867 Free PMC article.
References
-
- Dahl F, Stenberg J, Fredriksson S, Welch K, Zhang M, Nilsson M, et al. Multigene amplification and massively parallel sequencing for cancer mutation discovery. Proceedings of the National Academy of Sciences of the United States of America. 2007. May;104(22):9387–92. 10.1073/pnas.0702165104 - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous