. 2016 Jun 22:353:i3140.

doi: 10.1136/bmj.i3140.

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

Richard D Riley¹, Joie Ensor², Kym I E Snell³, Thomas P A Debray⁴, Doug G Altman⁵, Karel G M Moons⁴, Gary S Collins⁵

Affiliations

¹ Research Institute for Primary Care and Health Sciences, Keele University, Keele ST5 5BG, Staffordshire, UK r.riley@keele.ac.uk.
² Research Institute for Primary Care and Health Sciences, Keele University, Keele ST5 5BG, Staffordshire, UK.
³ Institute of Applied Health Research, University of Birmingham, Edgbaston, Birmingham, UK.
⁴ Julius Centre for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands Cochrane Netherlands, University Medical Center Utrecht, Utrecht, Netherlands.
⁵ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK.

PMID: 27334381
PMCID: PMC4916924
DOI: 10.1136/bmj.i3140

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

Richard D Riley et al. BMJ. 2016.

. 2016 Jun 22:353:i3140.

doi: 10.1136/bmj.i3140.

Authors

Richard D Riley¹, Joie Ensor², Kym I E Snell³, Thomas P A Debray⁴, Doug G Altman⁵, Karel G M Moons⁴, Gary S Collins⁵

Affiliations

¹ Research Institute for Primary Care and Health Sciences, Keele University, Keele ST5 5BG, Staffordshire, UK r.riley@keele.ac.uk.
² Research Institute for Primary Care and Health Sciences, Keele University, Keele ST5 5BG, Staffordshire, UK.
³ Institute of Applied Health Research, University of Birmingham, Edgbaston, Birmingham, UK.
⁴ Julius Centre for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands Cochrane Netherlands, University Medical Center Utrecht, Utrecht, Netherlands.
⁵ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK.

PMID: 27334381
PMCID: PMC4916924
DOI: 10.1136/bmj.i3140

Erratum in

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges.
[No authors listed] [No authors listed] BMJ. 2019 Jun 25;365:l4379. doi: 10.1136/bmj.l4379. BMJ. 2019. PMID: 31239248 Free PMC article. No abstract available.

Abstract

Access to big datasets from e-health records and individual participant data (IPD) meta-analysis is signalling a new advent of external validation studies for clinical prediction models. In this article, the authors illustrate novel opportunities for external validation in big, combined datasets, while drawing attention to methodological challenges and reporting issues.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared.

Figures

**Fig 1**
Format of typical prediction models seen in the medical literature

**Fig 2**
Calibration performance (as measured by the E/O statistic) of a diagnostic prediction model for deep vein thrombosis, over all studies combined and in each of the 12 studies separately. E=total number expected to have deep vein thrombosis according to the prediction model; O=total number observed with deep vein thrombosis; I²=proportion (%) of variability in the ln(E/O) estimates in the meta-analysis that is due to between-study variation (genuine differences between studies in the true ln(E/O)), rather than within-study sampling error (chance)

**Fig 3**
Funnel plots of discrimination performance (as measured by the C statistic) of QRISK2, across all 364 general practice surgeries in the external validation dataset of Collins and Altman. Plots show C statistic versus (a) number of cardiovascular events and (b) standard error of logit C statistic

**Fig 4**
Calibration of QRISK2 and the Framingham risk score in women aged 35 to 74 years, (a) by tenth of predicted risk augmented with a smoothed calibration curve, and (b) within eight age groups. Dotted lines=denote perfect calibration

**Fig 5**
Association between percentage of smokers and C statistic for QRISK2 across all 364 general practice surgeries in the external validation dataset of Collins and Altman. Circle size is weighted by the precision of the C statistic estimate (that is, larger circles indicate C statistic estimates with smaller standard errors, and thus more weight in the meta-regression). Note: the solid line shows the meta-regression slope when data are analysed on the C statistic scale; similar findings and trends were obtained when reanalysing the logit C statistic scale

**Fig 6**
Calibration performance (as measured by the calibration slope) of the breast cancer model evaluated by Snell and colleagues before and after recalibration of the baseline mortality rate in each country. (a) Forest plot assuming the same baseline hazard rate in each country (no recalibration). (b) Forest plot allowing a different baseline hazard rate for each country (recalibration)

See this image and copyright information in PMC

References

1. Steyerberg EW. Clinical prediction models: a practical approach to development, validation, and updating.Springer, 2009. 10.1007/978-0-387-77244-8. - DOI
1. Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: Developing a prognostic model. BMJ 2009;338:b604 10.1136/bmj.b604 pmid:19336487. - DOI - PubMed
1. Steyerberg EW, Moons KG, van der Windt DA, et al. PROGRESS Group. Prognosis Research Strategy (PROGRESS) 3: prognostic model research. PLoS Med 2013;10:e1001381 10.1371/journal.pmed.1001381 pmid:23393430. - DOI - PMC - PubMed
1. Anderson KM, Odell PM, Wilson PW, Kannel WB. Cardiovascular disease risk profiles. Am Heart J 1991;121:293-8. 10.1016/0002-8703(91)90861-B pmid:1985385. - DOI - PubMed
1. Hippisley-Cox J, Coupland C, Vinogradova Y, et al. Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2. BMJ 2008;336:1475-82. 10.1136/bmj.39609.449676.25 pmid:18573856. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- ClinicalTrials.gov

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

Affiliations

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical