Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Apr;2(4):E108.
doi: 10.1371/journal.pbio.0020108. Epub 2004 Apr 13.

Semi-supervised methods to predict patient survival from gene expression data

Affiliations

Semi-supervised methods to predict patient survival from gene expression data

Eric Bair et al. PLoS Biol. 2004 Apr.

Abstract

An important goal of DNA microarray research is to develop tools to diagnose cancer more accurately based on the genetic profile of a tumor. There are several existing techniques in the literature for performing this type of diagnosis. Unfortunately, most of these techniques assume that different subtypes of cancer are already known to exist. Their utility is limited when such subtypes have not been previously identified. Although methods for identifying such subtypes exist, these methods do not work well for all datasets. It would be desirable to develop a procedure to find such subtypes that is applicable in a wide variety of circumstances. Even if no information is known about possible subtypes of a certain form of cancer, clinical information about the patients, such as their survival time, is often available. In this study, we develop some procedures that utilize both the gene expression data and the clinical data to identify subtypes of cancer and use this knowledge to diagnose future patients. These procedures were successfully applied to several publicly available datasets. We present diagnostic procedures that accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients. This has the potential to be a powerful tool for diagnosing and treating cancer.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no conflicts of interest exist.

Figures

Figure 1
Figure 1. Two Patient Subgroups with Overlapping Survival Times
Figure 2
Figure 2. Comparison of the Survival Curves of the “Low-Risk” and “High-Risk” Groups
These were obtained by applying nearest shrunken centroids to the DLBCL test data. Patients in the training data were assigned to either the “low-risk” or “high-risk” group depending on whether or not their survival time was greater than the median survival time of all the patients.
Figure 3
Figure 3. Comparison of the Survival Curves Resulting from Applying Two Different Clustering Methods to the DLBCL Data
Figure 4
Figure 4. Comparison of the Survival Curves Resulting from Applying Two Different Clustering Methods to the DLBCL Data
Figure 5
Figure 5. Survival Curves for Clusters Derived from the DLBCL Data
Figure 6
Figure 6. Plot of Survival Versus the Predictor υ^I for the DLBCL Data

Similar articles

Cited by

References

    1. Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature. 2000;403:503–511. - PubMed
    1. Beer DG, Kardia SL, Huang CC, Giordano TJ, Levin AM, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med. 2002;8:816–824. - PubMed
    1. Ben-Dor A, Friedman N, Yakhini Z. Montreal (Quebec): ACM Press; 2001. Class discovery in gene expression data. Proceedings of the fifth annual international conference on computational biology; pp. 31–38. - DOI
    1. Bhattacharjee A, Richards WG, Staunton J, Li C, Monti S, et al. Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc Natl Acad Sci U S A. 2001;98:13790–13795. - PMC - PubMed
    1. Bullinger L, Döhner K, Bair E, Fröhling S, Schlenk R, et al. Gene expression profiling identifies new subclasses and improves outcome prediction in adult myeloid leukemia. N Engl J Med. 2004 in press. - PubMed

Publication types