cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate
- PMID: 21486749
- PMCID: PMC3130288
- DOI: 10.1093/nar/gkr197
cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate
Abstract
Cost-effective oligonucleotide genotyping arrays like the Affymetrix SNP 6.0 are still the predominant technique to measure DNA copy number variations (CNVs). However, CNV detection methods for microarrays overestimate both the number and the size of CNV regions and, consequently, suffer from a high false discovery rate (FDR). A high FDR means that many CNVs are wrongly detected and therefore not associated with a disease in a clinical study, though correction for multiple testing takes them into account and thereby decreases the study's discovery power. For controlling the FDR, we propose a probabilistic latent variable model, 'cn.FARMS', which is optimized by a Bayesian maximum a posteriori approach. cn.FARMS controls the FDR through the information gain of the posterior over the prior. The prior represents the null hypothesis of copy number 2 for all samples from which the posterior can only deviate by strong and consistent signals in the data. On HapMap data, cn.FARMS clearly outperformed the two most prevalent methods with respect to sensitivity and FDR. The software cn.FARMS is publicly available as a R package at http://www.bioinf.jku.at/software/cnfarms/cnfarms.html.
Figures






Similar articles
-
cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.Nucleic Acids Res. 2012 May;40(9):e69. doi: 10.1093/nar/gks003. Epub 2012 Feb 1. Nucleic Acids Res. 2012. PMID: 22302147 Free PMC article.
-
Algorithm implementation for CNV discovery using Affymetrix and Illumina SNP array data.Methods Mol Biol. 2012;838:291-310. doi: 10.1007/978-1-61779-507-7_14. Methods Mol Biol. 2012. PMID: 22228018
-
Evaluation of copy number variation detection for a SNP array platform.BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50. BMC Bioinformatics. 2014. PMID: 24555668 Free PMC article.
-
Comparing CNV detection methods for SNP arrays.Brief Funct Genomic Proteomic. 2009 Sep;8(5):353-66. doi: 10.1093/bfgp/elp017. Epub 2009 Sep 8. Brief Funct Genomic Proteomic. 2009. PMID: 19737800 Review.
-
Evaluation of the performance of copy number variant prediction tools for the detection of deletions from whole genome sequencing data.J Biomed Inform. 2019 Jun;94:103174. doi: 10.1016/j.jbi.2019.103174. Epub 2019 Apr 6. J Biomed Inform. 2019. PMID: 30965134 Review.
Cited by
-
Integrating genetics and epigenetics in breast cancer: biological insights, experimental, computational methods and therapeutic potential.BMC Syst Biol. 2015 Sep 21;9:62. doi: 10.1186/s12918-015-0211-x. BMC Syst Biol. 2015. PMID: 26391647 Free PMC article. Review.
-
DEXUS: identifying differential expression in RNA-Seq studies with unknown conditions.Nucleic Acids Res. 2013 Nov;41(21):e198. doi: 10.1093/nar/gkt834. Epub 2013 Sep 17. Nucleic Acids Res. 2013. PMID: 24049071 Free PMC article.
-
Identification and validation of copy number variants using SNP genotyping arrays from a large clinical cohort.BMC Genomics. 2012 Jun 15;13:241. doi: 10.1186/1471-2164-13-241. BMC Genomics. 2012. PMID: 22702538 Free PMC article.
-
Current analysis platforms and methods for detecting copy number variation.Physiol Genomics. 2013 Jan 7;45(1):1-16. doi: 10.1152/physiolgenomics.00082.2012. Epub 2012 Nov 6. Physiol Genomics. 2013. PMID: 23132758 Free PMC article. Review.
-
Live attenuated Rev-independent Nef¯SIV enhances acquisition of heterologous SIVsmE660 in acutely vaccinated rhesus macaques.PLoS One. 2013 Sep 30;8(9):e75556. doi: 10.1371/journal.pone.0075556. eCollection 2013. PLoS One. 2013. PMID: 24098702 Free PMC article.
References
-
- Jakobsson M, Scholz SW, Scheet P, Gibbs JR, VanLiere JM, Fung H-C, Szpiech ZA, Degnan JH, Wang K, Guerreiro R, et al. Genotype, haplotype and copy-number variation in worldwide human populations. Nature. 2008;451:998–1003. - PubMed
-
- Gonzalez E, Kulkarni H, Bolivar H, Mangano A, Sanchez R, Catano G, Nibbs RJ, Freedman BI, Quinones MP, Bamshad MJ, et al. The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science. 2005;307:1434–1440. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases