A new method for class prediction based on signed-rank algorithms applied to Affymetrix microarray experiments
- PMID: 18190711
- PMCID: PMC2248160
- DOI: 10.1186/1471-2105-9-16
A new method for class prediction based on signed-rank algorithms applied to Affymetrix microarray experiments
Abstract
Background: The huge amount of data generated by DNA chips is a powerful basis to classify various pathologies. However, constant evolution of microarray technology makes it difficult to mix data from different chip types for class prediction of limited sample populations. Affymetrix(R) technology provides both a quantitative fluorescence signal and a decision (detection call: absent or present) based on signed-rank algorithms applied to several hybridization repeats of each gene, with a per-chip normalization. We developed a new prediction method for class belonging based on the detection call only from recent Affymetrix chip type. Biological data were obtained by hybridization on U133A, U133B and U133Plus 2.0 microarrays of purified normal B cells and cells from three independent groups of multiple myeloma (MM) patients.
Results: After a call-based data reduction step to filter out non class-discriminative probe sets, the gene list obtained was reduced to a predictor with correction for multiple testing by iterative deletion of probe sets that sequentially improve inter-class comparisons and their significance. The error rate of the method was determined using leave-one-out and 5-fold cross-validation. It was successfully applied to (i) determine a sex predictor with the normal donor group classifying gender with no error in all patient groups except for male MM samples with a Y chromosome deletion, (ii) predict the immunoglobulin light and heavy chains expressed by the malignant myeloma clones of the validation group and (iii) predict sex, light and heavy chain nature for every new patient. Finally, this method was shown powerful when compared to the popular classification method Prediction Analysis of Microarray (PAM).
Conclusion: This normalization-free method is routinely used for quality control and correction of collection errors in patient reports to clinicians. It can be easily extended to multiple class prediction suitable with clinical groups, and looks particularly promising through international cooperative projects like the "Microarray Quality Control project of US FDA" MAQC as a predictive classifier for diagnostic, prognostic and response to treatment. Finally, it can be used as a powerful tool to mine published data generated on Affymetrix systems and more generally classify samples with binary feature values.
Figures

Similar articles
-
Optimized between-group classification: a new jackknife-based gene selection procedure for genome-wide expression data.BMC Bioinformatics. 2005 Sep 28;6:239. doi: 10.1186/1471-2105-6-239. BMC Bioinformatics. 2005. PMID: 16191195 Free PMC article.
-
An alternative method to amplify RNA without loss of signal conservation for expression analysis with a proteinase DNA microarray in the ArrayTube format.BMC Genomics. 2006 Jun 12;7:144. doi: 10.1186/1471-2164-7-144. BMC Genomics. 2006. PMID: 16768788 Free PMC article.
-
Cross-generation and cross-laboratory predictions of Affymetrix microarrays by rank-based methods.J Biomed Inform. 2008 Aug;41(4):570-9. doi: 10.1016/j.jbi.2007.11.005. Epub 2007 Dec 4. J Biomed Inform. 2008. PMID: 18234562
-
Classification based upon gene expression data: bias and precision of error rates.Bioinformatics. 2007 Jun 1;23(11):1363-70. doi: 10.1093/bioinformatics/btm117. Epub 2007 Mar 28. Bioinformatics. 2007. PMID: 17392326 Review.
-
Reproducible and reliable microarray results through quality control: good laboratory proficiency and appropriate data analysis practices are essential.Curr Opin Biotechnol. 2008 Feb;19(1):10-8. doi: 10.1016/j.copbio.2007.11.003. Epub 2007 Dec 26. Curr Opin Biotechnol. 2008. PMID: 18155896 Review.
Cited by
-
In vivo treatment with epigenetic modulating agents induces transcriptional alterations associated with prognosis and immunomodulation in multiple myeloma.Oncotarget. 2015 Feb 20;6(5):3319-34. doi: 10.18632/oncotarget.3207. Oncotarget. 2015. PMID: 25669970 Free PMC article.
-
Insulin is a potent myeloma cell growth factor through insulin/IGF-1 hybrid receptor activation.Leukemia. 2010 Nov;24(11):1940-50. doi: 10.1038/leu.2010.192. Epub 2010 Sep 16. Leukemia. 2010. PMID: 20844560 Free PMC article.
-
Microarray Detection Call Methodology as a Means to Identify and Compare Transcripts Expressed within Syncytial Cells from Soybean (Glycine max) Roots Undergoing Resistant and Susceptible Reactions to the Soybean Cyst Nematode (Heterodera glycines).J Biomed Biotechnol. 2010;2010:491217. doi: 10.1155/2010/491217. Epub 2010 May 19. J Biomed Biotechnol. 2010. PMID: 20508855 Free PMC article.
-
Gene expression-based prediction of myeloma cell sensitivity to histone deacetylase inhibitors.Br J Cancer. 2013 Aug 6;109(3):676-85. doi: 10.1038/bjc.2013.392. Epub 2013 Jul 18. Br J Cancer. 2013. PMID: 23868005 Free PMC article.
-
Detection call algorithms for high-throughput gene expression microarray data.Brief Bioinform. 2010 Mar;11(2):244-52. doi: 10.1093/bib/bbp055. Epub 2009 Nov 25. Brief Bioinform. 2010. PMID: 19939941 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous