Reporting guideline for chatbot health advice studies: the Chatbot Assessment Reporting Tool (CHART) statement

CHART Collaborative

Collaborators

CHART Collaborative:
Bright Huo, Gary Collins, David Chartash, Arun Thirunavukarasu, Annette Flanagin, Alfonso Iorio, Giovanni Cacciamani, Xi Chen, Nan Liu, Piyush Mathur, An Wen Chan, Christine Laine, Daniela Pacella, Michael Berkwits, Stavros A Antoniou, Jennifer C Camaradou, Carolyn Canfield, Michael Mittelman, Timothy Feeney, Elizabeth Loder, Riaz Agha, Ashirbani Saha, Julio Mayol, Anthony Sunjaya, Hugh Harvey, Jeremy Y Ng, Tyler McKechnie, Yung Lee, Nipun Verma, Gregor Stiglic, Melissa McCradden, Karim Ramji, Vanessa Boudreau, Monica Ortenzi, Joerg Meerpohl, Per Olav Vandvik, Thomas Agoritsas, Diana Samuel, Helen Frankish, Michael Anderson, Xiaomei Yao, Stacy Loeb, Cynthia Lokker, Xiaoxuan Liu, Eliseo Guallar, Gordon Guyatt

PMID: 40747825
PMCID: PMC12314741
DOI: 10.1093/bjs/znaf142

Reporting guideline for chatbot health advice studies: the Chatbot Assessment Reporting Tool (CHART) statement

CHART Collaborative. Br J Surg. 2025.

. 2025 Aug 1;112(8):znaf142.

doi: 10.1093/bjs/znaf142.

Author

CHART Collaborative

Collaborators

CHART Collaborative:
Bright Huo, Gary Collins, David Chartash, Arun Thirunavukarasu, Annette Flanagin, Alfonso Iorio, Giovanni Cacciamani, Xi Chen, Nan Liu, Piyush Mathur, An Wen Chan, Christine Laine, Daniela Pacella, Michael Berkwits, Stavros A Antoniou, Jennifer C Camaradou, Carolyn Canfield, Michael Mittelman, Timothy Feeney, Elizabeth Loder, Riaz Agha, Ashirbani Saha, Julio Mayol, Anthony Sunjaya, Hugh Harvey, Jeremy Y Ng, Tyler McKechnie, Yung Lee, Nipun Verma, Gregor Stiglic, Melissa McCradden, Karim Ramji, Vanessa Boudreau, Monica Ortenzi, Joerg Meerpohl, Per Olav Vandvik, Thomas Agoritsas, Diana Samuel, Helen Frankish, Michael Anderson, Xiaomei Yao, Stacy Loeb, Cynthia Lokker, Xiaoxuan Liu, Eliseo Guallar, Gordon Guyatt

PMID: 40747825
PMCID: PMC12314741
DOI: 10.1093/bjs/znaf142

Abstract

The Chatbot Assessment Reporting Tool (CHART) is a reporting guideline developed to provide reporting recommendations for studies evaluating the performance of generative artificial intelligence (AI)-driven chatbots when summarizing clinical evidence and providing health advice, referred to as chatbot health advice studies. CHART was developed in several phases after performing a comprehensive systematic review to identify variation in the conduct, reporting, and method in chatbot health advice studies. Findings from the review were used to develop a draft checklist that was revised through an international, multidisciplinary, modified, asynchronous Delphi consensus process of 531 stakeholders, three synchronous panel consensus meetings of 48 stakeholders, and subsequent pilot testing of the checklist. CHART includes 12 items and 39 subitems to promote transparent and comprehensive reporting of chatbot health advice studies. These include title (subitem 1a), abstract/summary (subitem 1b), background (subitems 2a,b), model identifiers (subitems 3a,b), model details (subitems 4a-c), prompt engineering (subitems 5a,b), query strategy (subitems 6a-d), performance evaluation (subitems 7a,b), sample size (subitem 8), data analysis subitem 9a), results (subitems 10a-c), discussion (subitems 11a-c), disclosures (subitem 12a), funding (subitem 12b), ethics (subitem 12c), protocol (subitem 12d), and data availability (subitem 12e). The CHART checklist and corresponding diagram of the method were designed to support key stakeholders including clinicians, researchers, editors, peer reviewers, and readers in reporting, understanding, and interpreting the findings of chatbot health advice studies.

© The Author(s) 2025. Published by Oxford University Press on behalf of BJS Foundation Ltd, by Elsevier BV, by Annals of Family Medicine, Inc., by Springer Nature, by BMJ Publishing Group Limited, and by American Medical Association.

PubMed Disclaimer

Figures

**Fig. 1**
CHART methodological diagram AI, artificial intelligence; API, application programming interfaces.

See this image and copyright information in PMC

References

1. Kolbinger FR, Veldhuizen GP, Zhu J, Truhn D, Kather JN. Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis. Commun Med (Lond) 2024;4:71. - PMC - PubMed
1. Han R, Acosta JN, Shakeri Z, Ioannidis JPA, Topol EJ, Rajpurkar P. Randomised controlled trials evaluating artificial intelligence in clinical practice: a scoping review. Lancet Digit Health 2024;6:e367–e373 - PMC - PubMed
1. Huo B, Cacciamani GE, Collins GS, McKechnie T, Lee Y, Guyatt G. Reporting standards for the use of large language model-linked chatbots for health advice. Nat Med 2023;29:2988–2988 - PubMed
1. Huo B, McKechnie T, Ortenzi M, Lee Y, Antoniou S, Mayol J et al. Dr. GPT will see you now: the ability of large language model-linked chatbots to provide colorectal cancer screening recommendations. Health Technol (Berl) 2024;14:463–469
1. Huo B, Marfo N, Sylla P, Calabrese E, Kumar S, Slater BJ et al. Clinical artificial intelligence: teaching a large language model to generate recommendations that align with guidelines for the surgical management of GERD. Surg Endosc 2024;38:5668–5677 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

McMaster University

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Reporting guideline for chatbot health advice studies: the Chatbot Assessment Reporting Tool (CHART) statement

Collaborators

Reporting guideline for chatbot health advice studies: the Chatbot Assessment Reporting Tool (CHART) statement

Author

Collaborators

Abstract

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources