A weighted sample size for microarray datasets that considers the variability of variance and multiplicity
- PMID: 19664562
- DOI: 10.1016/j.jbiosc.2009.03.017
A weighted sample size for microarray datasets that considers the variability of variance and multiplicity
Abstract
Microarray experiments are often performed to detect differently expressed genes among different clinical phenotypes. The method used to calculate the appropriate sample size for this purpose differs from the sample size calculation used for general clinical experiments, because microarrays include tens of thousands of genes. We proposed a sample size calculation method that considers variance among an entire gene set and used the Bonferroni correction to address the multiplicity problem. Specifically, by adjusting for the multiplicity problem, the existing equation for sample size calculation was modified based on the Bonferroni correction. By k-means cluster analysis, the variances across all genes can be divided into several groups with similar values, and the sample sizes for each group were subsequently calculated and weight-averaged. The results of this study show that the sample size was related to the number of genes on a chip. The weighted sample size, calculated by the proposed method, preserved the Type I error for selection of significant genes within a microarray data set.
Similar articles
-
Sample size for FDR-control in microarray data analysis.Bioinformatics. 2005 Jul 15;21(14):3097-104. doi: 10.1093/bioinformatics/bti456. Epub 2005 Apr 21. Bioinformatics. 2005. PMID: 15845654
-
Sample size calculations based on ranking and selection in microarray experiments.Biometrics. 2008 Mar;64(1):217-26. doi: 10.1111/j.1541-0420.2007.00875.x. Epub 2007 Aug 3. Biometrics. 2008. PMID: 17680829
-
Practical FDR-based sample size calculations in microarray experiments.Bioinformatics. 2005 Aug 1;21(15):3264-72. doi: 10.1093/bioinformatics/bti519. Epub 2005 Jun 2. Bioinformatics. 2005. PMID: 15932903
-
Clinical uses of microarrays in cancer research.Methods Mol Med. 2008;141:87-113. doi: 10.1007/978-1-60327-148-6_6. Methods Mol Med. 2008. PMID: 18453086 Free PMC article. Review.
-
[The problem of small "n" and big "P" in neuropsycho-pharmacology, or how to keep the rate of false discoveries under control].Neuropsychopharmacol Hung. 2015 Mar;17(1):23-30. Neuropsychopharmacol Hung. 2015. PMID: 25935380 Review. Hungarian.
Cited by
-
Technical variability is greater than biological variability in a microarray experiment but both are outweighed by changes induced by stimulation.PLoS One. 2011;6(5):e19556. doi: 10.1371/journal.pone.0019556. Epub 2011 May 31. PLoS One. 2011. PMID: 21655321 Free PMC article.
-
Small sample sizes in high-throughput miRNA screens: A common pitfall for the identification of miRNA biomarkers.Biomol Detect Quantif. 2017 Dec 18;15:1-5. doi: 10.1016/j.bdq.2017.11.002. eCollection 2018 May. Biomol Detect Quantif. 2017. PMID: 29276692 Free PMC article.
-
Determination of minimum training sample size for microarray-based cancer outcome prediction-an empirical assessment.PLoS One. 2013 Jul 5;8(7):e68579. doi: 10.1371/journal.pone.0068579. Print 2013. PLoS One. 2013. PMID: 23861920 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources