DeepCNV: a deep learning approach for authenticating copy number variations
- PMID: 33429424
- PMCID: PMC8681111
- DOI: 10.1093/bib/bbaa381
DeepCNV: a deep learning approach for authenticating copy number variations
Abstract
Copy number variations (CNVs) are an important class of variations contributing to the pathogenesis of many disease phenotypes. Detecting CNVs from genomic data remains difficult, and the most currently applied methods suffer from an unacceptably high false positive rate. A common practice is to have human experts manually review original CNV calls for filtering false positives before further downstream analysis or experimental validation. Here, we propose DeepCNV, a deep learning-based tool, intended to replace human experts when validating CNV calls, focusing on the calls made by one of the most accurate CNV callers, PennCNV. The sophistication of the deep neural network algorithm is enriched with over 10 000 expert-scored samples that are split into training and testing sets. Variant confidence, especially for CNVs, is a main roadblock impeding the progress of linking CNVs with the disease. We show that DeepCNV adds to the confidence of the CNV calls with an optimal area under the receiver operating characteristic curve of 0.909, exceeding other machine learning methods. The superiority of DeepCNV was also benchmarked and confirmed using an experimental wet-lab validation dataset. We conclude that the improvement obtained by DeepCNV results in significantly fewer false positive results and failures to replicate the CNV association results.
Keywords: copy number variation; deep learning.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Figures







Similar articles
-
DL-CNV: A deep learning method for identifying copy number variations based on next generation target sequencing.Math Biosci Eng. 2019 Sep 30;17(1):202-215. doi: 10.3934/mbe.2020011. Math Biosci Eng. 2019. PMID: 31731347
-
Genome-wide algorithm for detecting CNV associations with diseases.BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331. BMC Bioinformatics. 2011. PMID: 21827692 Free PMC article.
-
A systematic benchmark of copy number variation detection tools for high density SNP genotyping arrays.Genomics. 2024 Nov;116(6):110962. doi: 10.1016/j.ygeno.2024.110962. Epub 2024 Nov 14. Genomics. 2024. PMID: 39547585
-
Assessing the reproducibility of exome copy number variations predictions.Genome Med. 2016 Aug 8;8(1):82. doi: 10.1186/s13073-016-0336-6. Genome Med. 2016. PMID: 27503473 Free PMC article.
-
[Copy number variations in the human genome: their mutational mechanisms and roles in diseases].Yi Chuan. 2011 Aug;33(8):857-69. doi: 10.3724/sp.j.1005.2011.00857. Yi Chuan. 2011. PMID: 21831802 Review. Chinese.
Cited by
-
Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.Brief Bioinform. 2022 Mar 10;23(2):bbac043. doi: 10.1093/bib/bbac043. Brief Bioinform. 2022. PMID: 35211719 Free PMC article.
-
SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data.BMC Bioinformatics. 2023 May 23;24(1):213. doi: 10.1186/s12859-023-05324-x. BMC Bioinformatics. 2023. PMID: 37221476 Free PMC article.
-
Chromothripsis detection with multiple myeloma patients based on deep graph learning.Bioinformatics. 2023 Jul 1;39(7):btad422. doi: 10.1093/bioinformatics/btad422. Bioinformatics. 2023. PMID: 37399092 Free PMC article.
-
Artificial intelligence: A powerful paradigm for scientific research.Innovation (Camb). 2021 Oct 28;2(4):100179. doi: 10.1016/j.xinn.2021.100179. eCollection 2021 Nov 28. Innovation (Camb). 2021. PMID: 34877560 Free PMC article. Review.
-
Using Copy Number Variation Data and Neural Networks to Predict Cancer Metastasis Origin Achieves High Area under the Curve Value with a Trade-Off in Precision.Curr Issues Mol Biol. 2024 Aug 1;46(8):8301-8319. doi: 10.3390/cimb46080490. Curr Issues Mol Biol. 2024. PMID: 39194707 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous