Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jun 20;9(6):e100234.
doi: 10.1371/journal.pone.0100234. eCollection 2014.

Prediction of survival with alternative modeling techniques using pseudo values

Affiliations

Prediction of survival with alternative modeling techniques using pseudo values

Tjeerd van der Ploeg et al. PLoS One. .

Abstract

Background: The use of alternative modeling techniques for predicting patient survival is complicated by the fact that some alternative techniques cannot readily deal with censoring, which is essential for analyzing survival data. In the current study, we aimed to demonstrate that pseudo values enable statistically appropriate analyses of survival outcomes when used in seven alternative modeling techniques.

Methods: In this case study, we analyzed survival of 1282 Dutch patients with newly diagnosed Head and Neck Squamous Cell Carcinoma (HNSCC) with conventional Kaplan-Meier and Cox regression analysis. We subsequently calculated pseudo values to reflect the individual survival patterns. We used these pseudo values to compare recursive partitioning (RPART), neural nets (NNET), logistic regression (LR) general linear models (GLM) and three variants of support vector machines (SVM) with respect to dichotomous 60-month survival, and continuous pseudo values at 60 months or estimated survival time. We used the area under the ROC curve (AUC) and the root of the mean squared error (RMSE) to compare the performance of these models using bootstrap validation.

Results: Of a total of 1282 patients, 986 patients died during a median follow-up of 66 months (60-month survival: 52% [95% CI: 50%-55%]). The LR model had the highest optimism corrected AUC (0.791) to predict 60-month survival, followed by the SVM model with a linear kernel (AUC 0.787). The GLM model had the smallest optimism corrected RMSE when continuous pseudo values were considered for 60-month survival or the estimated survival time followed by SVM models with a linear kernel. The estimated importance of predictors varied substantially by the specific aspect of survival studied and modeling technique used.

Conclusions: The use of pseudo values makes it readily possible to apply alternative modeling techniques to survival problems, to compare their performance and to search further for promising alternative modeling techniques to analyze survival time.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Survival pattern 1282 patients with newly diagnosed HNSCC.
Figure 2
Figure 2. Censoring pattern 1282 patients with newly diagnosed HNSCC.
Figure 3
Figure 3. Variable importance of the models per outcome.

Similar articles

Cited by

References

    1. Kaplan EL, Meier P (1958) Nonparametric Estimation from Incomplete Observations. J Am Stat Assoc 53: 457–481. Available: http://www.jstor.org/stable/2281868\nhttp://www.jstor.org/stable/pdfplus/2281868.pdf?acceptTC=true.
    1. Cox DR (1972) Regression Models and Life-Tables. J R Stat Soc Ser B 34: 187–220 10.2307/2985181 - DOI
    1. Lee YJ, Mangasarian OL, Wolberg WH (2000) Breast cancer survival and chemotherapy: a support vector machine analysis. Discrete Mathematical Problems with Medical Applications: DIMACS Workshop Discrete Mathematical Problems with Medical Applications, December 8–10, 1999, DIMACS Center. Vol. 55. p. 1.
    1. Chen S, Härdle WK, Moro RA (2006) Estimation of default probabilities with support vector machines.
    1. Intrator O, Kooperberg C (1995) Trees and splines in survival analysis. Stat Methods Med Res 4: 237–261 10.1177/096228029500400305 - DOI - PubMed

LinkOut - more resources