Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Feb 4:9:15-30.
doi: 10.4137/cin.s3794.

A robust gene selection method for microarray-based cancer classification

Affiliations

A robust gene selection method for microarray-based cancer classification

Xiaosheng Wang et al. Cancer Inform. .

Abstract

Gene selection is of vital importance in molecular classification of cancer using high-dimensional gene expression data. Because of the distinct characteristics inherent to specific cancerous gene expression profiles, developing flexible and robust feature selection methods is extremely crucial. We investigated the properties of one feature selection approach proposed in our previous work, which was the generalization of the feature selection method based on the depended degree of attribute in rough sets. We compared the feature selection method with the established methods: the depended degree, chi-square, information gain, Relief-F and symmetric uncertainty, and analyzed its properties through a series of classification experiments. The results revealed that our method was superior to the canonical depended degree of attribute based method in robustness and applicability. Moreover, the method was comparable to the other four commonly used methods. More importantly, the method can exhibit the inherent classification difficulty with respect to different gene expression datasets, indicating the inherent biology of specific cancers.

Keywords: cancer classification; dependent degree; feature selection; machine learning; microarrays; rough sets.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Best classification accuracy of each classifier.
Figure 2.
Figure 2.
Average classification accuracy of each classifier.
Figure 3.
Figure 3.
Average classification accuracy for different α values.
Figure 4.
Figure 4.
Best classification accuracy for different α values.
Figure 5.
Figure 5.
Relationship between AHA and α.
Figure 6.
Figure 6.
Relationship between ABAA and α.
Figure 7.
Figure 7.
Average accuracy under each α value in Colon tumor.
Figure 8.
Figure 8.
Best accuracy under each α value in Colon tumor.
Figure 9.
Figure 9.
Contrast in average accuracy for different feature selection methods.
Figure 10.
Figure 10.
Contrast in best accuracy for different feature selection methods.

Similar articles

Cited by

References

    1. Golub TR, Slonim DK, Tamayo P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999;286(5439):531–7. - PubMed
    1. Wang X, Gotoh O. Microarray-Based Cancer Prediction Using Soft Computing Approach. Cancer Informatics. 2009;7:123–39. - PMC - PubMed
    1. Li J, Wong L. Identifying good diagnostic gene groups from gene expression profiles using the concept of emerging patterns. Bioinformatics. 2002;18(5):725–34. - PubMed
    1. Geman D, d’Avignon C, Naiman DQ, Winslow RL.Classifying gene expression profiles from pairwise mRNA comparisons Stat Appl Genet Mol Biol 20043:Article 19. - PMC - PubMed
    1. Alon U, Barkai N, Notterman DA, et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci U S A. 1999;96(12):6745–50. - PMC - PubMed

LinkOut - more resources