On the Dependence of the Critical Success Index (CSI) on Prevalence
- PMID: 38473017
- PMCID: PMC10931251
- DOI: 10.3390/diagnostics14050545
On the Dependence of the Critical Success Index (CSI) on Prevalence
Abstract
The critical success index (CSI) is an established metric used in meteorology to verify the accuracy of weather forecasts. It is defined as the ratio of hits to the sum of hits, false alarms, and misses. Translationally, CSI has gained popularity as a unitary outcome measure in various clinical situations where large numbers of true negatives may influence the interpretation of other, more traditional, outcome measures, such as specificity (Spec) and negative predictive value (NPV), or when unified interpretation of positive predictive value (PPV) and sensitivity (Sens) is needed. The derivation of CSI from measures including PPV has prompted questions as to whether and how CSI values may vary with disease prevalence (P), just as PPV estimates are dependent on P, and hence whether CSI values are generalizable between studies with differing prevalences. As no detailed study of the relation of CSI to prevalence has been undertaken hitherto, the dataset of a previously published test accuracy study of a cognitive screening instrument was interrogated to address this question. Three different methods were used to examine the change in CSI across a range of prevalences, using both the Bayes formula and equations directly relating CSI to Sens, PPV, P, and the test threshold (Q). These approaches showed that, as expected, CSI does vary with prevalence, but the dependence differs according to the method of calculation that is adopted. Bayesian rescaling of both Sens and PPV generates a concave curve, suggesting that CSI will be maximal at a particular prevalence, which may vary according to the particular dataset.
Keywords: Bayes formula; F measure; binary classification; critical success index; prevalence.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures

Similar articles
-
Critical success index or F measure to validate the accuracy of administrative healthcare data identifying epilepsy in deceased adults in Scotland.Epilepsy Res. 2024 Jan;199:107275. doi: 10.1016/j.eplepsyres.2023.107275. Epub 2023 Dec 12. Epilepsy Res. 2024. PMID: 38128202
-
Using Critical Success Index or Gilbert Skill Score as composite measures of positive predictive value and sensitivity in diagnostic accuracy studies: Weather forecasting informing epilepsy research.Epilepsia. 2023 Jun;64(6):1466-1468. doi: 10.1111/epi.17537. Epub 2023 Apr 4. Epilepsia. 2023. PMID: 36756707
-
Improving the accuracy of synovial fluid analysis in the diagnosis of prosthetic joint infection with simple and inexpensive biomarkers: C-reactive protein and adenosine deaminase.Bone Joint J. 2017 Mar;99-B(3):351-357. doi: 10.1302/0301-620X.99B3.BJJ-2016-0684.R1. Bone Joint J. 2017. PMID: 28249975
-
Performance characteristics and quality control of community based ultrasound surveys for cystic and alveolar echinococcosis.Acta Trop. 2003 Feb;85(2):203-9. doi: 10.1016/s0001-706x(02)00224-3. Acta Trop. 2003. PMID: 12606098 Review.
-
Polymerase chain reaction blood tests for the diagnosis of invasive aspergillosis in immunocompromised people.Cochrane Database Syst Rev. 2019 Sep 3;9(9):CD009551. doi: 10.1002/14651858.CD009551.pub4. Cochrane Database Syst Rev. 2019. PMID: 31478559 Free PMC article.
References
-
- Larner A.J. The 2 × 2 Matrix: Contingency, Confusion and the Metrics of Binary Classification. 2nd ed. Springer; London, UK: 2024.
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials