Generalizable mass spectrometry mining used to identify disease state biomarkers from blood serum
- PMID: 12973730
- DOI: 10.1002/pmic.200300516
Generalizable mass spectrometry mining used to identify disease state biomarkers from blood serum
Abstract
We bring a "spectrum" of classical data mining and statistical analysis methods to bear on discrimination of two groups of spectra from 24 diseased and 17 normal patients. Our primary goal is to accurately estimate the generalizability of this small dataset. After an aggressive preprocessing step that reduces consideration to only 55 peaks, we conduct over 35 out-of-sample cross-validation simulations of logistic regression, binary decision trees, and linear discriminant analysis. Misclassification rates grow worse as the size of the holdout sample increases, with many exceeding 30 percent. The ability to generalize is clearly tempered by the statistical, instrumentation, and biophysical characteristics of the study.
Similar articles
-
Protocols for disease classification from mass spectrometry data.Proteomics. 2003 Sep;3(9):1692-8. doi: 10.1002/pmic.200300519. Proteomics. 2003. PMID: 12973727
-
Identification and validation of a potential lung cancer serum biomarker detected by matrix-assisted laser desorption/ionization-time of flight spectra analysis.Proteomics. 2003 Sep;3(9):1720-4. doi: 10.1002/pmic.200300514. Proteomics. 2003. PMID: 12973732
-
Detection of lung cancer using plasma protein profiling by matrix-assisted laser desorption/ionization mass spectrometry.Eur J Mass Spectrom (Chichester). 2010;16(4):539-49. doi: 10.1255/ejms.1080. Eur J Mass Spectrom (Chichester). 2010. PMID: 20625202
-
Identification of a 17-protein signature in the serum of lung cancer patients.Oncol Rep. 2010 Jul;24(1):263-70. doi: 10.3892/or_00000855. Oncol Rep. 2010. PMID: 20514471
-
The MALDI-TOF mass spectrometric view of the plasma proteome and peptidome.Clin Chem. 2006 Jul;52(7):1223-37. doi: 10.1373/clinchem.2006.069252. Epub 2006 Apr 27. Clin Chem. 2006. PMID: 16644871 Review.
Cited by
-
Molecular prognostic prediction for locally advanced nasopharyngeal carcinoma by support vector machine integrated approach.PLoS One. 2012;7(3):e31989. doi: 10.1371/journal.pone.0031989. Epub 2012 Mar 9. PLoS One. 2012. PMID: 22427815 Free PMC article. Clinical Trial.
-
Parametric power spectral density analysis of noise from instrumentation in MALDI TOF mass spectrometry.Cancer Inform. 2007 Sep 17;3:219-30. Cancer Inform. 2007. PMID: 19455245 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical