Statistical evaluation of SAGE libraries: consequences for experimental design
- PMID: 12407185
- DOI: 10.1152/physiolgenomics.00042.2002
Statistical evaluation of SAGE libraries: consequences for experimental design
Abstract
Since the introduction of serial analysis of gene expression (SAGE) as a method to quantitatively analyze the differential expression of genes, several statistical tests have been published for the pairwise comparison of SAGE libraries. Testing the difference between the number of specific tags found in two SAGE libraries is hampered by the fact that each SAGE library is only one measurement: the necessary information on biological variation or experimental precision is not available. In the currently available tests, a measure of this variance is obtained from simulation or based on the properties of the tag distribution. To help the user of SAGE to decide between these tests, five different pairwise tests have been compared by determining the critical values, that is, the lowest number of tags that, given an observed number of tags in one library, needs to be found in the other library to result in a significant P value. The five tests included in this comparison are SAGE300, the tests described by Madden et al. (Oncogene 15: 1079-1085, 1997) and by Audic and Claverie (Genome Res 7: 986-995, 1997), Fisher's Exact test, and the Z test, which is equivalent to the chi-squared test. The comparison showed that, for SAGE libraries of equal as well as different size, SAGE300, Fisher's Exact test, Z test, and the Audic and Claverie test have critical values within 1.5% of each other. This indicates that these four tests will give essentially the same results when applied to SAGE libraries. The Madden test, which can only be used for libraries of similar size, is, with 25% higher critical values, more conservative, probably because the variance measure in its test statistic is not appropriate for hypothesis testing. The consequences for the choice of SAGE library sizes are discussed.
Similar articles
-
Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries.BMC Bioinformatics. 2004 Oct 6;5:143. doi: 10.1186/1471-2105-5-143. BMC Bioinformatics. 2004. PMID: 15469608 Free PMC article.
-
[Transcriptomes for serial analysis of gene expression].J Soc Biol. 2002;196(4):303-7. J Soc Biol. 2002. PMID: 12645300 Review. French.
-
Statistical modeling of sequencing errors in SAGE libraries.Bioinformatics. 2004 Aug 4;20 Suppl 1:i31-9. doi: 10.1093/bioinformatics/bth924. Bioinformatics. 2004. PMID: 15262778
-
LyM: a tool to reach the best factor in gene expression comparison.In Silico Biol. 2007;7(1):101-4. In Silico Biol. 2007. PMID: 17688434
-
[Gene expression profiling using improved SAGE].Rinsho Byori. 2002 Jan;50(1):52-60. Rinsho Byori. 2002. PMID: 11871137 Review. Japanese.
Cited by
-
Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach.BMC Bioinformatics. 2005 Jun 29;6:165. doi: 10.1186/1471-2105-6-165. BMC Bioinformatics. 2005. PMID: 15987513 Free PMC article.
-
Searching for molecular markers in head and neck squamous cell carcinomas (HNSCC) by statistical and bioinformatic analysis of larynx-derived SAGE libraries.BMC Med Genomics. 2008 Nov 11;1:56. doi: 10.1186/1755-8794-1-56. BMC Med Genomics. 2008. PMID: 19014460 Free PMC article.
-
Evaluation of the chicken transcriptome by SAGE of B cells and the DT40 cell line.BMC Genomics. 2004 Dec 21;5:98. doi: 10.1186/1471-2164-5-98. BMC Genomics. 2004. PMID: 15610564 Free PMC article.
-
Deep Sequencing of Suppression Subtractive Hybridisation Drought and Recovery Libraries of the Non-model Crop Trifolium repens L.Front Plant Sci. 2017 Feb 23;8:213. doi: 10.3389/fpls.2017.00213. eCollection 2017. Front Plant Sci. 2017. PMID: 28280499 Free PMC article.
-
Overdispersed logistic regression for SAGE: modelling multiple groups and covariates.BMC Bioinformatics. 2004 Oct 6;5:144. doi: 10.1186/1471-2105-5-144. BMC Bioinformatics. 2004. PMID: 15469612 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources