. 2024 Mar 19;150(3):140.

doi: 10.1007/s00432-024-05678-6.

Utilizing large language models in breast cancer management: systematic review

Vera Sorin^{1

2}, Benjamin S Glicksberg³, Yaara Artsi⁴, Yiftach Barash^{5

6}, Eli Konen⁵, Girish N Nadkarni^{3

7}, Eyal Klang^{3

7}

Affiliations

¹ Department of Diagnostic Imaging, Chaim Sheba Medical Center, Affiliated to the Sackler School of Medicine, Tel-Aviv University, Emek Haela St. 1, 52621, Ramat Gan, Israel. verasrn@gmail.com.
² DeepVision Lab, Chaim Sheba Medical Center, Tel Hashomer, Israel. verasrn@gmail.com.
³ Division of Data-Driven and Digital Medicine (D3M), Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁴ Azrieli Faculty of Medicine, Bar-Ilan University, Zefat, Israel.
⁵ Department of Diagnostic Imaging, Chaim Sheba Medical Center, Affiliated to the Sackler School of Medicine, Tel-Aviv University, Emek Haela St. 1, 52621, Ramat Gan, Israel.
⁶ DeepVision Lab, Chaim Sheba Medical Center, Tel Hashomer, Israel.
⁷ The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

PMID: 38504034
PMCID: PMC10950983
DOI: 10.1007/s00432-024-05678-6

Utilizing large language models in breast cancer management: systematic review

Vera Sorin et al. J Cancer Res Clin Oncol. 2024.

. 2024 Mar 19;150(3):140.

doi: 10.1007/s00432-024-05678-6.

Authors

Vera Sorin^{1

2}, Benjamin S Glicksberg³, Yaara Artsi⁴, Yiftach Barash^{5

6}, Eli Konen⁵, Girish N Nadkarni^{3

7}, Eyal Klang^{3

7}

Affiliations

¹ Department of Diagnostic Imaging, Chaim Sheba Medical Center, Affiliated to the Sackler School of Medicine, Tel-Aviv University, Emek Haela St. 1, 52621, Ramat Gan, Israel. verasrn@gmail.com.
² DeepVision Lab, Chaim Sheba Medical Center, Tel Hashomer, Israel. verasrn@gmail.com.
³ Division of Data-Driven and Digital Medicine (D3M), Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁴ Azrieli Faculty of Medicine, Bar-Ilan University, Zefat, Israel.
⁵ Department of Diagnostic Imaging, Chaim Sheba Medical Center, Affiliated to the Sackler School of Medicine, Tel-Aviv University, Emek Haela St. 1, 52621, Ramat Gan, Israel.
⁶ DeepVision Lab, Chaim Sheba Medical Center, Tel Hashomer, Israel.
⁷ The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

PMID: 38504034
PMCID: PMC10950983
DOI: 10.1007/s00432-024-05678-6

Abstract

Purpose: Despite advanced technologies in breast cancer management, challenges remain in efficiently interpreting vast clinical data for patient-specific insights. We reviewed the literature on how large language models (LLMs) such as ChatGPT might offer solutions in this field.

Methods: We searched MEDLINE for relevant studies published before December 22, 2023. Keywords included: "large language models", "LLM", "GPT", "ChatGPT", "OpenAI", and "breast". The risk bias was evaluated using the QUADAS-2 tool.

Results: Six studies evaluating either ChatGPT-3.5 or GPT-4, met our inclusion criteria. They explored clinical notes analysis, guideline-based question-answering, and patient management recommendations. Accuracy varied between studies, ranging from 50 to 98%. Higher accuracy was seen in structured tasks like information retrieval. Half of the studies used real patient data, adding practical clinical value. Challenges included inconsistent accuracy, dependency on the way questions are posed (prompt-dependency), and in some cases, missing critical clinical information.

Conclusion: LLMs hold potential in breast cancer care, especially in textual information extraction and guideline-driven clinical question-answering. Yet, their inconsistent accuracy underscores the need for careful validation of these models, and the importance of ongoing supervision.

Keywords: Artificial intelligence; Breast cancer; GPT; Large language models.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

The authors declare that they have no conflict of interest.

Figures

**Fig. 1**
Flow Diagram of the Inclusion Process. Flow diagram of the search and inclusion process based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines

**Fig. 2**
Applications of large language models in breast cancer care and the corresponding accuracies achieved in various tasks in the different studies

See this image and copyright information in PMC

References

1. Brin D, Sorin V, Konen E, Nadkarni G, Glicksberg BS, Klang E (2023) How large language models perform on the united states medical licensing examination: a systematic review. medRxiv 23:543
1. Bubeck S, Chandrasekaran V, Eldan R, et al. (2023) Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712
1. Chaudhry HJ, Katsufrakis PJ, Tallia AF (2020) The USMLE step 1 decision. JAMA 323(20):2017 - PubMed
1. Choi HS, Song JY, Shin KH, Chang JH, Jang B-S (2023) Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer. Radiat Oncol J 41(3):209–216 - PMC - PubMed
1. Decker H, Trang K, Ramirez J et al (2023) Large language Model−based Chatbot vs Surgeon-generated informed consent documentation for common procedures. JAMA Netw Open 6(10):e2336997 - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Utilizing large language models in breast cancer management: systematic review

Affiliations

Utilizing large language models in breast cancer management: systematic review

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous