Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019;46(16):2987-3007.
doi: 10.1080/02664763.2019.1625876. Epub 2019 Jun 7.

SVM-CART for Disease Classification

Affiliations

SVM-CART for Disease Classification

Evan Reynolds et al. J Appl Stat. 2019.

Abstract

Classification and regression trees (CART) and support vector machines (SVM) have become very popular statistical learning tools for analyzing complex data that often arise in biomedical research. While both CART and SVM serve as powerful classifiers in many clinical settings, there are some common scenarios in which each fails to meet the performance and interpretability needed for use as a clinical decision-making tool. In this paper, we propose a new classification method, SVM-CART, that combines features of SVM and CART to produce a more flexible classifier that has the potential to outperform either method in terms of interpretability and prediction accuracy. Further-more, to enhance prediction accuracy we provide extensions of a single SVM-CART to an ensemble, and methods to extract a representative classifier from the SVM-CART ensemble. The goal is to produce a decision-making tool that can be used in the clinical setting, while still harnessing the stability and predictive improvements gained through developing the SVM-CART ensemble. An extensive simulation study is conducted to asses the performance of the methods in various settings. Finally, we illustrate our methods using a clinical neuropathy dataset.

Keywords: Classification and Regression Trees; Complex Interactions; Ensemble Classifiers; Statistical Learning; Support Vector Machines.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Results from SVM and CART for Simulated Scenarios
Figure 2:
Figure 2:
SVM-CART for Simulated Scenarios
Figure 3:
Figure 3:
Data Simulated from Underlying SVM-CART Like Structure
Figure 4:
Figure 4:
% Correct Classification Percentage for Tuning Parameter Selection
Figure 5:
Figure 5:
Single SVM-CART Classifier for Neuropathy
Figure 6:
Figure 6:
Most Representative SVM-CART Classifier from the Ensemble

Similar articles

Cited by

References

    1. Breiman L, Friedman JH, Olshen RA, and Stone CJ Classification and Regression Trees Belmont, California: Wadsworth; 1984.
    1. Zhang H and Singer B Recursive Partitioning in the Health Sciences Springer: New York: 1999.
    1. Hastie T, Tibshirani R and Friedman J The Elements of Statistical Learning Springer: New York: 2001.
    1. Cutler A, Cutler DR and Stevens JR “Tree-based methods”. High-Dimensional Data Analysis in Cancer Research, 2009; 24, 123–140.
    1. Banerjee M. “Tree-based model for thyroid cancer prognostication”. The Journal of Clinical Endocrinology & Metabolosim 2014;99(10),3737–3745. - PMC - PubMed

LinkOut - more resources