Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 26;122(34):e2424203122.
doi: 10.1073/pnas.2424203122. Epub 2025 Aug 20.

Minimizing and quantifying uncertainty in AI-informed decisions: Applications in medicine

Affiliations

Minimizing and quantifying uncertainty in AI-informed decisions: Applications in medicine

Samuel D Curtis et al. Proc Natl Acad Sci U S A. .

Abstract

AI is now a cornerstone of modern dataset analysis. In many real world applications, practitioners are concerned with controlling specific kinds of errors, rather than minimizing the overall number of errors. For example, biomedical screening assays may primarily be concerned with mitigating the number of false positives rather than false negatives. Quantifying uncertainty in AI-based predictions, and in particular those controlling specific kinds of errors, remains theoretically and practically challenging. We develop a strategy called multidimensional informed generalized hypothesis testing (MIGHT) which we prove accurately quantifies uncertainty and confidence given sufficient data, and concomitantly controls for particular error types. Our key insight was that it is possible to integrate canonical cross-validation and parametric calibration procedures within a nonparametric ensemble method. Simulations demonstrate that while typical AI based-approaches cannot be trusted to obtain the truth, MIGHT can be. We apply MIGHT to answer an open question in liquid biopsies using circulating cell-free DNA (ccfDNA) in individuals with or without cancer: Which biomarkers, or combinations thereof, can we trust? Performance estimates produced by MIGHT on ccfDNA data have coefficients of variation that are often orders of magnitude lower than other state of the art algorithms such as support vector machines, random forests, and Transformers, while often also achieving higher sensitivity. We find that combinations of variable sets often decrease rather than increase sensitivity over the optimal single variable set because some variable sets add more noise than signal. This work demonstrates the importance of quantifying uncertainty and confidence-with theoretical guarantees-for the interpretation of real-world data.

Keywords: biomarkers; biomedical assays; cancer screening; hypothesis testing; predictive modeling.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:J.T. has a speaking, advisory, and consultancy with Haystack Oncology, Amgen, Novartis, AstraZeneca, Merck Serono, Merck Sharp & Dohme, Beigene, Pierre Fabre, Bristol Myers Squibb, Gilead, and Daiichi Sankyo. K.W.K., N.P., and B.V. are founders of Thrive Earlier Detection, an Exact Sciences Company. K.W.K., N.P., and C.D. are consultants to Thrive Earlier Detection. K.W.K. and N.P. are consultants to Neophore. K.W.K., N.P., B.V., and C.D. hold equity in Exact Sciences. K.W.K., N.P., and B.V. are founders of and own equity in Haystack Oncology & ManaT Bio. B.V. is a consultant to and holds equity in Catalio Capital Management. C.B. is a co-founder of OrisDx. C.B. and C.D. are co-founders of Belay Diagnostics. K.W.K., N.P., and B.V. hold equity in and are consultants to CAGE Pharma. The companies named above, as well as other companies, have licensed previously described technologies related to the work described in this paper from Johns Hopkins University. C.B., K.W.K., N.P., and B.V., and C.D. are inventors on some of these technologies. Licenses to these technologies are or will be associated with equity or royalty payments to the inventors as well as to Johns Hopkins University. Patent applications on the work described in this paper may be filed by Johns Hopkins University. The terms of all these arrangements are being managed by Johns Hopkins University in accordance with its conflict of interest policies. One of the reviewers of this study is at McGill alongside L.G., but there was no collaboration or association between them.

References

    1. Hardt M., Recht B., Patterns, Predictions, and Actions: Foundations of Machine Learning (Princeton University Press, 2022).
    1. Yu B., Kumbier K., Veridical data science. Proc. Natl. Acad. Sci. U.S.A. 117, 3920–3929 (2020). - PMC - PubMed
    1. Ghasem S. M., et al. , A novel method to guide biomarker combinations to optimize the sensitivity. bioRxiv [Preprint] (2024). https://www.biorxiv.org/content/10.1101/2024.04.12.589302v1 (Accessed 15 April 2024). - DOI
    1. Dodd L. E., Pepe M. S., Partial AUC estimation and regression. Biometrics 59, 614–623 (2003). - PubMed
    1. Devroye L., Gyorfi L., Krzyzak A., Lugosi G., On the strong universal consistency of nearest neighbor regression function estimates. Ann. Stat. 22, 1371–1385 (1994).