Identifying disease-associated copy number variations by a doubly penalized regression model
- PMID: 29894562
- PMCID: PMC6663092
- DOI: 10.1111/biom.12920
Identifying disease-associated copy number variations by a doubly penalized regression model
Abstract
Copy number variation (CNV) of DNA plays an important role in the development of many diseases. However, due to the irregularity and sparsity of the CNVs, studying the association between CNVs and a disease outcome or a trait can be challenging. Up to now, not many methods have been proposed in the literature for this problem. Most of the current researchers reply on an ad hoc two-stage procedure by first identifying CNVs in each individual genome and then performing an association test using these identified CNVs. This potentially leads to information loss and as a result a lower power to identify disease associated CNVs. In this article, we describe a new method that combines the two steps into a single coherent model to identify the common CNV across patients that are associated with certain diseases. We use a double penalty model to capture CNVs' association with both the intensities and the disease trait. We validate its performance in simulated datasets and a data example on platinum resistance and CNV in ovarian cancer genome.
Keywords: Association study; Copy number variation; Ovarian cancer; Penalized regression model.
© 2018, The International Biometric Society.
Figures





Similar articles
-
Genome-wide algorithm for detecting CNV associations with diseases.BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331. BMC Bioinformatics. 2011. PMID: 21827692 Free PMC article.
-
MCKAT: a multi-dimensional copy number variant kernel association test.BMC Bioinformatics. 2021 Dec 11;22(1):588. doi: 10.1186/s12859-021-04494-w. BMC Bioinformatics. 2021. PMID: 34895138 Free PMC article.
-
Noise cancellation using total variation for copy number variation detection.BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x. BMC Bioinformatics. 2018. PMID: 30343665 Free PMC article.
-
Copy number variations and human genetic disease.Curr Opin Pediatr. 2014 Dec;26(6):646-52. doi: 10.1097/MOP.0000000000000142. Curr Opin Pediatr. 2014. PMID: 25198053 Review.
-
Genomic copy number variations: A breakthrough in our knowledge on schizophrenia etiology?Neuro Endocrinol Lett. 2012;33(2):183-90. Neuro Endocrinol Lett. 2012. PMID: 22592199 Review.
Cited by
-
A Novel Computational Framework to Predict Disease-Related Copy Number Variations by Integrating Multiple Data Sources.Front Genet. 2021 Jun 29;12:696956. doi: 10.3389/fgene.2021.696956. eCollection 2021. Front Genet. 2021. PMID: 34267783 Free PMC article.
-
Supervised t-distributed stochastic neighbor embedding for data visualization and classification.INFORMS J Comput. 2021 Spring;33(2):419-835. doi: 10.1287/ijoc.2020.0961. Epub 2020 Sep 10. INFORMS J Comput. 2021. PMID: 34354339 Free PMC article.
-
Tissue-specific identification of multi-omics features for pan-cancer drug response prediction.iScience. 2022 Jul 19;25(8):104767. doi: 10.1016/j.isci.2022.104767. eCollection 2022 Aug 19. iScience. 2022. PMID: 35992090 Free PMC article.
References
-
- Alvarez AA, Lambers AR, Lancaster JM, Maxwell GL, Ali S, Gumbs C, et al. (2001). Allele Loss on Chromosome 1p36 in Epithelial Ovarian Cancers. Gynecologic Oncology 82, 94–98. - PubMed
-
- Benjamini Y, and Hochberg Y (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B 57 289–300.
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical