Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Apr 1:34:7.
doi: 10.18637/jss.v034.i07.

rpartOrdinal: An R Package for Deriving a Classification Tree for Predicting an Ordinal Response

Affiliations

rpartOrdinal: An R Package for Deriving a Classification Tree for Predicting an Ordinal Response

Kellie J Archer. J Stat Softw. .

Abstract

This paper describes an R package, rpartOrdinal, that implements alternative splitting functions for fitting a classification tree when interest lies in predicting an ordinal response. This includes the generalized Gini impurity function, which was introduced as a method for predicting an ordinal response by including costs of misclassification into the impurity function, as well as an alternative ordinal impurity function due to Piccarreta (2008) that does not require the assignment of misclassification costs. The ordered twoing splitting method, which is not defined as a decrease in node impurity, is also included in the package. Since, in the ordinal response setting, misclassifying observations to adjacent categories is a less egregious error than misclassifying observations to distant categories, this package also includes a function for estimating an ordinal measure of association, the gamma statistic.

PubMed Disclaimer

Figures

Figure 1
Figure 1
CT for low birthweight dataset using ordered twoing.
Figure 2
Figure 2
CT for low birthweight dataset using the ordinal impurity function.
Figure 3
Figure 3
CT for lowbwt using generalized Gini with linear cost of misclassification.
Figure 4
Figure 4
CT for lowbwt using generalized Gini with quadratic cost of misclassification.
Figure 5
Figure 5
CT for B-ALL using ordered twoing.
Figure 6
Figure 6
CT for B-ALL using the ordinal impurity function.
Figure 7
Figure 7
CT for B-ALL using generalized Gini with linear cost of misclassification.
Figure 8
Figure 8
CT for B-ALL using generalized Gini with quadratic cost of misclassification.

References

    1. Agresti AA. Categorical Data Analysis. 2nd edition John Wiley & Sons; Hoboken, NJ: 2002.
    1. Archer KJ, Mas VR. Ordinal Response Prediction Using Bootstrap Aggregation, with Application to a High-throughput Methylation Dataset. Statistics in Medicine. 2009;28:3597–3610. - PMC - PubMed
    1. Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and Regression Trees. Wadsworth Advanced Books and Software; Belmont, CA: 1984. Wadsworth Statistics/Probability Series.
    1. Chiaretti S, Li X, Gentleman R, Vitale A, Vignetti M, Mandelli F, Ritz J, Foá R. Gene Expression Profile of Adult T-cell Acute Lymphocytic Leukemia Identifies Distinct Subsets of Patients with Different Response to Therapy and Survival. Blood. 2004;103:2771–2778. - PubMed
    1. Chiaretti S, Li X, Gentleman R, Vitale A, Wang K, Mandelli F, Foá R, Ritz J. Gene Expression Profles of B-lineage Adult Lymphocytic Leukemia Reveal Genetic Patterns that Identify Lineage Derivation and Distinct Mechanisms of Transformation. Clinical Cancer Research. 2005;20:7209–7219. - PubMed

LinkOut - more resources