Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Mar 6:9:142.
doi: 10.1186/1471-2105-9-142.

Ranking analysis of F-statistics for microarray data

Affiliations

Ranking analysis of F-statistics for microarray data

Yuan-De Tan et al. BMC Bioinformatics. .

Abstract

Background: Microarray technology provides an efficient means for globally exploring physiological processes governed by the coordinated expression of multiple genes. However, identification of genes differentially expressed in microarray experiments is challenging because of their potentially high type I error rate. Methods for large-scale statistical analyses have been developed but most of them are applicable to two-sample or two-condition data.

Results: We developed a large-scale multiple-group F-test based method, named ranking analysis of F-statistics (RAF), which is an extension of ranking analysis of microarray data (RAM) for two-sample t-test. In this method, we proposed a novel random splitting approach to generate the null distribution instead of using permutation, which may not be appropriate for microarray data. We also implemented a two-simulation strategy to estimate the false discovery rate. Simulation results suggested that it has higher efficiency in finding differentially expressed genes among multiple classes at a lower false discovery rate than some commonly used methods. By applying our method to the experimental data, we found 107 genes having significantly differential expressions among 4 treatments at <0.7% FDR, of which 31 belong to the expressed sequence tags (ESTs), 76 are unique genes who have known functions in the brain or central nervous system and belong to six major functional groups.

Conclusion: Our method is suitable to identify differentially expressed genes among multiple groups, in particular, when sample size is small.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Profile of estimates of FDRs for a series of thresholds. λ1k and λ2k are two threshold functions from simulations 1 and 2 and were used to construct an estimation interval for estimate of FDR at threshold Δk. λk and λˆk are true and estimated FDRs at threshold Δk, respectively, where k = 1, 2,...,L.
Figure 2
Figure 2
The dot-plot of F-values versus f¯-values. F-values and f¯-values obtained from the simulated microarray data of 3770 genes were ranked where f¯-values were yielded by the random splitting approach. All ranked F - f¯ dots roughly fall on a diagonal line as expected by two sets of the same ranked distributions.
Figure 3
Figure 3
The scatter plot of F-values versus f¯-values. F-values were observed from real microarray data set and f¯-values yielded by random splitting approach are an estimate of null f-distribution.
Figure 4
Figure 4
Comparison between the observed (red) and simulated (blue) plots of F-values versus f¯-values. F-values were observed from real (red) and simulated (blue) microarray data sets of 3770 genes and 6 replicates. f¯-value yielded by randomly splitting approach is an estimate in null f-distribution. F-distribution from simulated data set without treatment effects is null distribution. Ranked F-values corresponds to ranked f¯-values.

Similar articles

Cited by

References

    1. DeRisi J, Penland L, Brown PO, Bittner ML, Meltzer PS, Ray M, Chen Y, Su YA, Trent JM. Use of a cDNA microarray to analyse gene expression patterns in human cancer. Nature genetics. 1996;14:457–460. doi: 10.1038/ng1296-457. - DOI - PubMed
    1. DeRisi JL, Iyer VR, Brown PO. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science. 1997;278:680–686. doi: 10.1126/science.278.5338.680. - DOI - PubMed
    1. Kim YD, Sohn NW, Kang C, Soh Y. DNA array reveals altered gene expression in response to focal cerebral ischemia. Brain research bulletin. 2002;58:491–498. doi: 10.1016/S0361-9230(02)00823-7. - DOI - PubMed
    1. Kerr MK, Martin M, Churchill GA. Analysis of variance for gene expression microarray data. J Comput Biol. 2000;7:819–837. doi: 10.1089/10665270050514954. - DOI - PubMed
    1. Holm S. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics. 1979;6:65–70.

Publication types

MeSH terms