Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Sep 6:46:101064.
doi: 10.1016/j.lanepe.2024.101064. eCollection 2024 Nov.

Artificial intelligence in oncology: ensuring safe and effective integration of language models in clinical practice

Affiliations
Review

Artificial intelligence in oncology: ensuring safe and effective integration of language models in clinical practice

Loïc Verlingue et al. Lancet Reg Health Eur. .

Abstract

In this Personal View, we address the latest advancements in automatic text analysis with artificial intelligence (AI) in medicine, with a focus on its implications in aiding treatment decisions in medical oncology. Acknowledging that a majority of hospital medical content is embedded in narrative format, natural language processing has become one of the most dynamic research fields for developing clinical decision support tools. In addition, large language models have recently reached unprecedented performance, notably when answering medical questions. Emerging applications include prognosis estimation, treatment recommendations, multidisciplinary tumor board recommendations and matching patients to recruiting clinical trials. Altogether, we advocate for a forward-looking approach in which the community efficiently initiates global prospective clinical evaluations of promising AI-based decision support systems. Such assessments will be essential to validate and evaluate potential biases, ensuring these innovations can be effectively and safely translated into practical tools for oncological practice. We are at a pivotal moment, where continued advancements in patient care must be pursued with scientific rigor.

Keywords: Artificial intelligence; Clinical trials; Healthcare; Language models; Natural language processing; Oncology.

PubMed Disclaimer

Conflict of interest statement

LV reports personal fees from Adaptherapy, is CEO of RESOLVED, has received non-personal fees from Pierre-Fabre and Servier, and a grant from Bristol-Myers Squibb, all outside the submitted work. As part of the Phase 1 unit of Centre Léon Bérard, as medical doctor, LV report being: Principal/sub-Investigator of Clinical Trials for Abbvie, Adaptimmune, Aduro Biotech, Agios Pharmaceuticals, Amgen, Argen-X Bvba, Arno Therapeutics, Astex Pharmaceuticals, Astra Zeneca Ab, Aveo, Basilea Pharmaceutica International Ltd, Bayer Healthcare Ag, Bbb Technologies Bv, Beigene, Blueprint Medicines, Boehringer Ingelheim, Boston Pharmaceuticals, Bristol Myers Squibb, Ca, Celgene Corporation, Chugai Pharmaceutical Co, Clovis Oncology, Cullinan-Apollo, Daiichi Sankyo, Debiopharm, Eisai, Eisai Limited, Eli Lilly, Exelixis, Faron Pharmaceuticals Ltd, Forma Tharapeutics, Gamamabs, Genentech, Glaxosmithkline, H3 Biomedicine, Hoffmann La Roche Ag, Imcheck Therapeutics, Innate Pharma, Institut De Recherche Pierre Fabre, Iris Servier, Janssen Cilag, Janssen Research Foundation, Kura Oncology, Kyowa Kirin Pharm. Dev, Lilly France, Loxo Oncology, Lytix Biopharma As, Medimmune, Menarini Ricerche, Merck Sharp & Dohme Chibret, Merrimack Pharmaceuticals, Merus, Millennium Pharmaceuticals, Molecular Partners Ag, Nanobiotix, Nektar Therapeutics, Novartis Pharma, Octimet Oncology Nv, Oncoethix, Oncopeptides, Orion Pharma, Ose Pharma, Pfizer, Pharma Mar, Pierre Fabre, Medicament, Roche, Sanofi Aventis, Seattle Genetics, Sotio A.S, Syros Pharmaceuticals, Taiho Pharma, Tesaro, Xencor. Research Grants from Astrazeneca, BMS, Boehringer Ingelheim, Janssen Cilag, Merck, Novartis, Onxeo, Pfizer, Roche, Sanofi. Non-financial support (drug supplied) from Astrazeneca, Bayer, BMS, Boringher Ingelheim, Medimmune, Merck, NH TherAGuiX, Onxeo, Pfizer. ED reports grants and personal fees from Roche Genentech, grants from Boehringer, grants from Astrazeneca, grants and personal fees from Merck Serono, grants from BMS, and grants from MSD Roche. JYB declares Grant from Roche for research program Profiler 2, Grant from the French NCI for LYRICAN+, Grant from MSD Avenir for an AI project, grant from ANR for the PORTRAIT program. The other authors declare no potential conflicts of interest.

Figures

Fig. 1
Fig. 1
Repartition of the number of machine learning methods applied to oncology-related tasks (left panel), as per PubMed query (assessed in February 2024). The right panel specifies the repartition of methods utilizing deep learning. MeSH terms that were used for each query are specified in bold with the related fraction. The color legend indicates the absolute count of results per query.
Fig. 2
Fig. 2
Trial Matching example by prompting the Large Language Model GPT4-turbo from https://chat.lmsys.org/. The medical report and synthetic patient profile was extracted from SemEval 2024, and selection criteria of the trial extracted from clinicaltrials.gov.
Fig. 3
Fig. 3
Examples of AI-stratified prospective clinical trial designs with randomisation. Scenario#1: randomization to assign patients to use or not the predictions of the tool, inspired from. This applies to situations such as patients monitoring (e.g. using automatic altert systems versus standard procedures), orientation to a specific interventions or another (e.g. orientation to targeted therapies based on the presence of a biomarker), or evaluate a new prognosis tool to decide for an intervention or not (e.g. a surgery, a new treatment line, a clinical trial). This designs alows to derive clear conclusion of the superiority of the prediction compared to the standard of care. Scenario #2: randomization obtain the information now or in a pre-defined periode of time, i.e. intra-patient comparison, also known as N-of-1 trials. In practice, when AI models are used as search engines for medical treatments or clinical trials, it seems unethical to spare patients from the information. It can apply to several clinical decision support systems, to the evaluation of the efficacy of a treatement (using overall reponse rate as primary endpoint), to the impact on the work load of careguivers, the quality of life of patients, among other criterias. Patients that are randomized in the standard of care arm, can have acess to the results of the system after an observation period, here 3 months. The comparison of the endpoint measures between the two period of time allows concluding on the impact of the intervention.

References

    1. Luchini C., Pea A., Scarpa A. Artificial intelligence in oncology: current applications and future perspectives. Br J Cancer. 2022;126(1):4–9. - PMC - PubMed
    1. Devlin J., Chang M.W., Lee K., Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. arXiv. 2019 http://arxiv.org/abs/1810.04805 [cité 5 oct 2020]; Disponible sur:
    1. He K., Mao R., Lin Q., et al. A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics. arXiv. 2023 http://arxiv.org/abs/2310.05694 [cité 18 mars 2024]. Disponible sur:
    1. Keraghel I., Morbieu S., Nadif M. A survey on recent advances in named entity recognition. arXiv. 2024 http://arxiv.org/abs/2401.10825 [cité 22 mars 2024]. Disponible sur:
    1. Chen S., Li Y., Lu S., et al. Evaluation of ChatGPT family of models for biomedical reasoning and classification. arXiv. 2023 http://arxiv.org/abs/2304.02496 [cité 18 mars 2024]. Disponible sur: - PMC - PubMed

LinkOut - more resources