Mapping the Advanced-Stage Epithelial Ovarian Cancer Landscape Goes Beyond Words: Two Large Language Models, Eight Tasks, One Journey
- PMID: 40217674
- PMCID: PMC11989528
- DOI: 10.3390/jcm14072223
Mapping the Advanced-Stage Epithelial Ovarian Cancer Landscape Goes Beyond Words: Two Large Language Models, Eight Tasks, One Journey
Abstract
Background/Objectives: The advancement of natural language processing (NLP) technologies has transformed various sectors. However, their application in the healthcare domain, particularly for analysing clinical notes, remains underdeveloped. We investigated the use of deep neural networks, specifically transformer-based models, to predict intraoperative and post-operative outcomes related to advanced-stage epithelial ovarian cancer cytoreduction (aEOC) using unstructured surgical notes. Methods: We evaluated the performance of RoBERTa, a general-purpose language model, and GatorTron, a domain-specific model, across eight binary classification tasks using the same dataset. The dataset consisted of 560 surgical records from patients with aEOC who underwent cytoreductive surgery at a tertiary UK reference centre. Predictive outcomes were converted into binary features to facilitate classification tasks. To enhance the contextual information available to the models, textual data from "operative findings" and "operative notes" were concatenated. Results: Our findings highlight the tangible benefits of employing domain-specific language models for clinical text analysis. GatorTron generally outperformed RoBERTa across most predictive tasks, underscoring the advantages of domain-specific pretraining for understanding medical terminology and context. Both models struggled to predict certain outcomes, particularly those involving post-operative events like major complications and length of hospital stay, despite adjustments in hyperparameters and training strategies. This limitation suggests that operative text alone may not sufficiently capture the complexities of post-operative recovery. Conclusions: These findings have valuable implications for developing medical AI systems to improve the delivery of modern aEOC healthcare.
Keywords: GatorTron; RoBERTa; epithelial ovarian cancer; natural language processing; operative notes; transfer learning.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures



Similar articles
-
RoBERTa-Assisted Outcome Prediction in Ovarian Cancer Cytoreductive Surgery Using Operative Notes.Cancer Control. 2023 Jan-Dec;30:10732748231209892. doi: 10.1177/10732748231209892. Cancer Control. 2023. PMID: 37915208 Free PMC article.
-
Critical assessment of transformer-based AI models for German clinical notes.JAMIA Open. 2022 Nov 15;5(4):ooac087. doi: 10.1093/jamiaopen/ooac087. eCollection 2022 Dec. JAMIA Open. 2022. PMID: 36380848 Free PMC article.
-
Comparison of Pretraining Models and Strategies for Health-Related Social Media Text Classification.Healthcare (Basel). 2022 Aug 5;10(8):1478. doi: 10.3390/healthcare10081478. Healthcare (Basel). 2022. PMID: 36011135 Free PMC article.
-
From admission to discharge: a systematic review of clinical natural language processing along the patient journey.BMC Med Inform Decis Mak. 2024 Aug 29;24(1):238. doi: 10.1186/s12911-024-02641-w. BMC Med Inform Decis Mak. 2024. PMID: 39210370 Free PMC article.
-
Natural Language Processing Applications in the Clinical Neurosciences: A Machine Learning Augmented Systematic Review.Acta Neurochir Suppl. 2022;134:277-289. doi: 10.1007/978-3-030-85292-4_32. Acta Neurochir Suppl. 2022. PMID: 34862552
References
-
- du Bois A., Reuss A., Pujade-Lauraine E., Harter P., Ray-Coquard I., Pfisterer J. Role of surgical outcome as prognostic factor in advanced epithelial ovarian cancer: A combined exploratory analysis of 3 prospectively randomized phase 3 multicenter trials. Cancer. 2009;115:1234–1244. doi: 10.1002/cncr.24149. - DOI - PubMed
-
- Chi D.S., Franklin C.C., Levine D.A., Akselrod F., Sabbatini P., Jarnagin W.R., DeMatteo R., Poynor E.A., Abu-Rustum N.R., Barakat R.R. Improved optimal cytoreduction rates for stages IIIC and IV epithelial ovarian, fallopian tube, and primary peritoneal cancer: A change in surgical approach. Gynecol. Oncol. 2004;94:650–654. - PubMed
LinkOut - more resources
Full Text Sources