Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Jul 17:18:2012-2025.
doi: 10.1016/j.csbj.2020.07.009. eCollection 2020.

Computational advances of tumor marker selection and sample classification in cancer proteomics

Affiliations
Review

Computational advances of tumor marker selection and sample classification in cancer proteomics

Jing Tang et al. Comput Struct Biotechnol J. .

Abstract

Cancer proteomics has become a powerful technique for characterizing the protein markers driving transformation of malignancy, tracing proteome variation triggered by therapeutics, and discovering the novel targets and drugs for the treatment of oncologic diseases. To facilitate cancer diagnosis/prognosis and accelerate drug target discovery, a variety of methods for tumor marker identification and sample classification have been developed and successfully applied to cancer proteomic studies. This review article describes the most recent advances in those various approaches together with their current applications in cancer-related studies. Firstly, a number of popular feature selection methods are overviewed with objective evaluation on their advantages and disadvantages. Secondly, these methods are grouped into three major classes based on their underlying algorithms. Finally, a variety of sample separation algorithms are discussed. This review provides a comprehensive overview of the advances on tumor maker identification and patients/samples/tissues separations, which could be guidance to the researches in cancer proteomics.

Keywords: ANN, Artificial Neural Network; ANOVA, Analysis of Variance; CFS, Correlation-based Feature Selection; Cancer proteomics; Computational methods; DAPC, Discriminant Analysis of Principal Component; DT, Decision Trees; EDA, Estimation of Distribution Algorithm; FC, Fold Change; GA, Genetic Algorithms; GR, Gain Ratio; HC, Hill Climbing; HCA, Hierarchical Cluster Analysis; IG, Information Gain; LDA, Linear Discriminant Analysis; LIMMA, Linear Models for Microarray Data; MBF, Markov Blanket Filter; MWW, Mann–Whitney–Wilcoxon test; OPLS-DA, Orthogonal Partial Least Squares Discriminant Analysis; PCA, Principal Component Analysis; PLS-DA, Partial Least Square Discriminant Analysis; RF, Random Forest; RF-RFE, Random Forest with Recursive Feature Elimination; SA, Simulated Annealing; SAM, Significance Analysis of Microarrays; SBE, Sequential Backward Elimination; SFS, and Sequential Forward Selection; SOM, Self-organizing Map; SU, Symmetrical Uncertainty; SVM, Support Vector Machine; SVM-RFE, Support Vector Machine with Recursive Feature Elimination; Sample classification; Tumor marker selection; sPLSDA, Sparse Partial Least Squares Discriminant Analysis; t-SNE, Student t Distribution; χ2, Chi-square.

PubMed Disclaimer

Figures

None
Graphical abstract
Fig. 1
Fig. 1
The workflow of cancer biomarker discovery in quantitative proteomics study.
Fig. 2
Fig. 2
Classification of available feature selection algorithms in cancer proteomic study.
Fig. 3
Fig. 3
Feature selections applied in cancer proteomics and popular sample classification methods.

Similar articles

Cited by

References

    1. Malvezzi M., Carioli G., Bertuccio P., Negri E., La Vecchia C. Relation between mortality trends of cardiovascular diseases and selected cancers in the European Union, in 1970–2017. Focus on cohort and period effects. Eur J Cancer. 2018;103:341–355. - PubMed
    1. Arora D., Chaudhary R., Singh A. System biology approach to identify potential receptor for targeting cancer and biomolecular interaction studies of indole[2,1-a]isoquinoline derivative as anticancerous drug candidate against it. Interdiscip Sci Comput Life Sci. 2017;11:125–134. - PubMed
    1. Reddy M.M., Kar S.S. Unconditional probability of dying and age-specific mortality rate because of major non-communicable diseases in India: time trends from 2001 to 2013. J Postgrad Med. 2018;65:11. - PMC - PubMed
    1. Guo Y., Wang H., Li Y., Song Y., Chen C., Liao Y. Genome of Helicobacter pylori strain XZ274, an isolate from a tibetan patient with gastric cancer in China. J Bacteriol. 2012;194:4146–4147. - PMC - PubMed
    1. Fu J., Tang J., Wang Y., Cui X., Yang Q., Hong J. Discovery of the consistently well-performed analysis chain for SWATH-MS based pharmacoproteomic quantification. Front Pharmacol. 2018;9:681. - PMC - PubMed

LinkOut - more resources