Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009:2009:587405.
doi: 10.1155/2009/587405. Epub 2010 Jan 10.

Simpler evaluation of predictions and signature stability for gene expression data

Affiliations

Simpler evaluation of predictions and signature stability for gene expression data

Yvonne E Pittelkow et al. J Biomed Biotechnol. 2009.

Abstract

Scientific advances are raising expectations that patient-tailored treatment will soon be available. The development of resulting clinical approaches needs to be based on well-designed experimental and observational procedures that provide data to which proper biostatistical analyses are applied. Gene expression microarray and related technology are rapidly evolving. It is providing extremely large gene expression profiles containing many thousands of measurements. Choosing a subset from these gene expression measurements to include in a gene expression signature is one of the many challenges needing to be met. Choice of this signature depends on many factors, including the selection of patients in the training set. So the reliability and reproducibility of the resultant prognostic gene signature needs to be evaluated, in such a way as to be relevant to the clinical setting. A relatively straightforward approach is based on cross validation, with separate selection of genes at each iteration to avoid selection bias. Within this approach we developed two different methods, one based on forward selection, the other on genes that were statistically significant in all training blocks of data. We demonstrate our approach to gene signature evaluation with a well-known breast cancer data set.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Performance assessment for method (ii). The pairs (TPF, FPF) estimated on the validation set are plotted in each graph. The assessment for the 59 gene signature (α1 ≤ .001) is shown on the upper and that for the 14 gene signature (α1 ≤ .0001) is shown on the lower.

Similar articles

Cited by

  • Systems biology and cancer: promises and perils.
    Baker SG, Kramer BS. Baker SG, et al. Prog Biophys Mol Biol. 2011 Aug;106(2):410-3. doi: 10.1016/j.pbiomolbio.2011.03.002. Epub 2011 Mar 23. Prog Biophys Mol Biol. 2011. PMID: 21419159 Free PMC article. Review.

References

    1. Schena M. Microarray Analysis. New York, NY, USA: Wiley-Liss; 2003.
    1. Michiels S, Koscielny S, Hill C. Prediction of cancer outcome with microarrays: a multiple random validation strategy. The Lancet. 2005;365(9458):488–492. - PubMed
    1. Baker SG, Kramer BS. Identifying genes that contribute most to good classification in microarrays. BMC Bioinformatics. 2006;7, article 407:1–7. - PMC - PubMed
    1. Ambroise C, McLachlan GJ. Selection bias in gene extraction on the basis of microarray gene-expression data. Proceedings of the National Academy of Sciences of the United States of America. 2002;99(10):6562–6566. - PMC - PubMed
    1. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York, NY, USA: Springer; 2001. (Springer Series in Statistics).

Publication types

MeSH terms