Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancer
- PMID: 12973724
- DOI: 10.1002/pmic.200300521
Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancer
Abstract
A classification and regression tree (CART) model was trained to classify 41 clinical specimens as disease/nondisease based on 26 variables computed from the mass-to-charge ratio (m/z) and peak heights of proteins identified by mass spectroscopy. The CART model built on all of the specimens (no cross-validation) had an error rate of 4/41 = 10%. The CART model suggests that mass spectra peaks in the 8000-10,000, 20,000-30,000, 45,000-60, 000, and >125,000 m/z ranges may be valuable in distinguishing between the disease/nondisease specimens. The area under the receiver operating characteristics curve was 0.80 +/- 0.07 for leave-one-out cross-validation.
Similar articles
-
Machine learning approaches to lung cancer prediction from mass spectra.Proteomics. 2003 Sep;3(9):1716-9. doi: 10.1002/pmic.200300523. Proteomics. 2003. PMID: 12973731
-
Generalizable mass spectrometry mining used to identify disease state biomarkers from blood serum.Proteomics. 2003 Sep;3(9):1710-5. doi: 10.1002/pmic.200300516. Proteomics. 2003. PMID: 12973730
-
Tree-based disease classification using protein data.Proteomics. 2003 Sep;3(9):1673-7. doi: 10.1002/pmic.200300520. Proteomics. 2003. PMID: 12973723
-
Predicting membrane protein types using various decision tree classifiers based on various modes of general PseAAC for imbalanced datasets.J Theor Biol. 2017 Dec 21;435:208-217. doi: 10.1016/j.jtbi.2017.09.018. Epub 2017 Sep 20. J Theor Biol. 2017. PMID: 28941868 Review.
-
Processing and classification of protein mass spectra.Mass Spectrom Rev. 2006 May-Jun;25(3):409-49. doi: 10.1002/mas.20072. Mass Spectrom Rev. 2006. PMID: 16463283 Review.
Cited by
-
Halfway to Automated Feeding of Chinese Hamster Ovary Cells.Sensors (Basel). 2023 Jul 23;23(14):6618. doi: 10.3390/s23146618. Sensors (Basel). 2023. PMID: 37514911 Free PMC article.
-
Classification of juvenile myoclonic epilepsy data acquired through scanning electromyography with machine learning algorithms.J Med Syst. 2012 Oct;36(5):2705-11. doi: 10.1007/s10916-011-9746-6. Epub 2011 Jun 17. J Med Syst. 2012. PMID: 21681512
-
Processing MALDI Mass Spectra to Improve Mass Spectral Direct Tissue Analysis.Int J Mass Spectrom. 2007 Feb 1;260(2-3):212-221. doi: 10.1016/j.ijms.2006.10.005. Int J Mass Spectrom. 2007. PMID: 17541451 Free PMC article.
-
Improved Classification of Lung Cancer Using Radial Basis Function Neural Network with Affine Transforms of Voss Representation.PLoS One. 2015 Dec 1;10(12):e0143542. doi: 10.1371/journal.pone.0143542. eCollection 2015. PLoS One. 2015. PMID: 26625358 Free PMC article.
-
Improved classification of lung cancer tumors based on structural and physicochemical properties of proteins using data mining models.PLoS One. 2013;8(3):e58772. doi: 10.1371/journal.pone.0058772. Epub 2013 Mar 7. PLoS One. 2013. PMID: 23505559 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical