Randomized Controlled Trial

. 2005 Oct 5;9(5):R522-9.

doi: 10.1186/cc3790. Epub 2005 Aug 9.

Application of a population-based severity scoring system to individual patients results in frequent misclassification

Frank V Booth¹, Mary Short, Andrew F Shorr, Nancy Arkins, Becky Bates, Rebecca L Qualy, Howard Levy

Affiliations

PMID: 16277714
PMCID: PMC1297617
DOI: 10.1186/cc3790

Randomized Controlled Trial

Application of a population-based severity scoring system to individual patients results in frequent misclassification

Frank V Booth et al. Crit Care. 2005.

. 2005 Oct 5;9(5):R522-9.

doi: 10.1186/cc3790. Epub 2005 Aug 9.

Authors

Frank V Booth¹, Mary Short, Andrew F Shorr, Nancy Arkins, Becky Bates, Rebecca L Qualy, Howard Levy

Affiliation

¹ Eli Lilly and Company, Indianapolis, IN, USA. boothfv@lilly.com

PMID: 16277714
PMCID: PMC1297617
DOI: 10.1186/cc3790

Abstract

Introduction: APACHE II (AP2) was developed to allow a systematic examination of intensive care unit outcomes in a risk adjusted manner. AP2 has been widely adopted in clinical trials to assure broad consistency amongst different groups. Although errors in calculating the true AP2 score may not be reducible below 15%, the self-canceling effect of random errors reduces the importance of such errors when applied to large populations. It has been suggested that a threshold AP2 score be used in clinical decision making for individual patients. This study reports the AP2 scoring errors of researchers involved in a large sepsis trial and models the consequences of such an error rate for individual severe sepsis patients.

Methods: Fifty-six researchers with explicit training in data abstraction and completion of the AP2 score received scenarios consisting of composites of real patient histories. Descriptive statistics were calculated for each scenario. The standard deviations were calculated compared with an adjudicated score. Intraclass correlations for inter-observer reliability were performed using Shrout-Fleiss methodology. Theoretical distribution curves were calculated for a broad range of AP2 scores using standard deviations of 6, 9 and 12. For each curve, the misclassification rate was determined using an AP2 score cut-off of >or=25. The percentage of misclassifications for each true AP2 score was then applied to the corresponding AP2 score obtained from the PROGRESS severe sepsis registry.

Results: The error rate for the total AP2 score was 86% (individual variables were in the range 10% to 87%). Intraclass correlation for the inter-observer reliability was 0.51. Of the patients from the PROGRESS registry. 50% had AP2 scores in the range 17 to 28. Within this interquartile range, 70% to 85% of all misclassified patients would reside.

Conclusion: It is more likely that an individual patient will be scored incorrectly than correctly. The data obtained from the scenarios indicated that as the true AP2 score approached an arbitrary cut-off point of 25, the observed misclassification rate increased. Integrating our study of AP2 score errors with the published literature leads us to conclude that the AP2 is an inappropriate sole tool for resource allocation decisions for individual patients.

PubMed Disclaimer

Figures

**Figure 1**
Results of the scoring exercise. The results of the scoring exercise completed by researchers involved in a large randomized placebo-controlled critical care trial illustrating individual scores, standard deviations and inter-quartile ranges of case scenarios with adjudicated total APACHE II scores of 44, 22 and 19. ¹Correct classification is determined by the adjudicated score being either APACHE II ≥25 or APACHE II <25. ²Standard deviation is calculated using the adjudicated APACHE II score in place of the mean APACHE II score.

**Figure 2**
Theoretical distributions of APACHE II scores with varying SDs. A set of theoretical distributions of reported APACHE II scores based on standard deviations of 6 and 12 (which were what we observed in the case scenario data.) For the purposes of comparison, a set of curves using an intermediate standard deviation of 9 is also shown. In every curve, the shaded area illustrates the theoretical probability of misclassification based on a cut-off score of ≥25.

**Figure 3**
Distribution of reported APACHE II scores in the PROGRESS registry. The darker shading (outer envelope) of these plots represents the observed distribution of APACHE II scores of 5,253 severe sepsis patients in the PROGRESS registry. The lighter shading (inner envelope) is calculated by applying the probability of misclassification for each individual APACHE II score based on assumed standard deviation (SD) of (from top to bottom) 6, 9 and 12 and on an APACHE II cut-off score ≥25.

See this image and copyright information in PMC

References

1. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13:818–829. - PubMed
1. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin. 1979;86:420–429. doi: 10.1037//0033-2909.86.2.420. - DOI - PubMed
1. Beale R, Reinhart K, Silva E, Dobb G, Sarwat S, Garg R, Vincent JL. Comparison of PROGRESS Severe Sepsis Registry patients to INDEPTH Integrated Severe Sepsis Clinical Trial Database placebo patients. Chest Meeting Abstracts; Chest. 2004;126:864S. Abstract #496.
1. Polderman KH, Thijs LG, Girbes AR. Interobserver variability in the use of APACHE II scores. Lancet. 1999;353:380. doi: 10.1016/S0140-6736(05)74953-9. - DOI - PubMed
1. Chen LM, Martin CM, Morrison TL, Sibbald WJ. Interobserver variability in data collection of the APACHE II score in teaching and community hospitals. Crit Care Med. 1999;27:1999–2004. doi: 10.1097/00003246-199909000-00046. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect - Access expert opinions and insights on biomedical research.
Medical
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Application of a population-based severity scoring system to individual patients results in frequent misclassification

Affiliation

Application of a population-based severity scoring system to individual patients results in frequent misclassification

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Research Materials