. 2024 Oct 3:26:e60601.

doi: 10.2196/60601.

Ascle-A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study

Rui Yang^#¹, Qingcheng Zeng^#², Keen You^#³, Yujie Qiao^#⁴, Lucas Huang³, Chia-Chun Hsieh³, Benjamin Rosand³, Jeremy Goldwasser³, Amisha Dave⁵, Tiarnan Keenan⁶, Yuhe Ke⁷, Chuan Hong⁸, Nan Liu^{1

9

10}, Emily Chew⁶, Dragomir Radev³, Zhiyong Lu¹¹, Hua Xu¹², Qingyu Chen¹², Irene Li^{13

14}

Affiliations

¹ Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore.
² Department of Linguistics, Northwestern University, Evanston, IL, United States.
³ Department of Computer Science, Yale University, New Haven, CT, United States.
⁴ Yale School of Public Health, Yale University, New Haven, CT, United States.
⁵ Yale New Haven Hospital, Yale School of Medicine, Yale University, New Haven, CT, United States.
⁶ Division of Epidemiology and Clinical Applications, National Eye Institute, National Institutes of Health, Bethesda, MD, United States.
⁷ Department of Anesthesiology, Singapore General Hospital, Singapore, Singapore.
⁸ Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, United States.
⁹ Program in Health Services and Systems Research, Duke-NUS Medical School, Singapore, Singapore.
¹⁰ Institute of Data Science, National University of Singapore, Singapore, Singapore.
¹¹ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States.
¹² Department of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, United States.
¹³ Information Technology Center, University of Tokyo, Kashiwa, Japan.
¹⁴ Smartor LLC, Tokyo, Japan.

^# Contributed equally.

PMID: 39361955
PMCID: PMC11487205
DOI: 10.2196/60601

Ascle-A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study

Rui Yang et al. J Med Internet Res. 2024.

. 2024 Oct 3:26:e60601.

doi: 10.2196/60601.

Authors

Affiliations

¹ Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore.
² Department of Linguistics, Northwestern University, Evanston, IL, United States.
³ Department of Computer Science, Yale University, New Haven, CT, United States.
⁴ Yale School of Public Health, Yale University, New Haven, CT, United States.
⁵ Yale New Haven Hospital, Yale School of Medicine, Yale University, New Haven, CT, United States.
⁶ Division of Epidemiology and Clinical Applications, National Eye Institute, National Institutes of Health, Bethesda, MD, United States.
⁷ Department of Anesthesiology, Singapore General Hospital, Singapore, Singapore.
⁸ Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, United States.
⁹ Program in Health Services and Systems Research, Duke-NUS Medical School, Singapore, Singapore.
¹⁰ Institute of Data Science, National University of Singapore, Singapore, Singapore.
¹¹ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States.
¹² Department of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, United States.
¹³ Information Technology Center, University of Tokyo, Kashiwa, Japan.
¹⁴ Smartor LLC, Tokyo, Japan.

^# Contributed equally.

PMID: 39361955
PMCID: PMC11487205
DOI: 10.2196/60601

Abstract

Background: Medical texts present significant domain-specific challenges, and manually curating these texts is a time-consuming and labor-intensive process. To address this, natural language processing (NLP) algorithms have been developed to automate text processing. In the biomedical field, various toolkits for text processing exist, which have greatly improved the efficiency of handling unstructured text. However, these existing toolkits tend to emphasize different perspectives, and none of them offer generation capabilities, leaving a significant gap in the current offerings.

Objective: This study aims to describe the development and preliminary evaluation of Ascle. Ascle is tailored for biomedical researchers and clinical staff with an easy-to-use, all-in-one solution that requires minimal programming expertise. For the first time, Ascle provides 4 advanced and challenging generative functions: question-answering, text summarization, text simplification, and machine translation. In addition, Ascle integrates 12 essential NLP functions, along with query and search capabilities for clinical databases.

Methods: We fine-tuned 32 domain-specific language models and evaluated them thoroughly on 27 established benchmarks. In addition, for the question-answering task, we developed a retrieval-augmented generation (RAG) framework for large language models that incorporated a medical knowledge graph with ranking techniques to enhance the reliability of generated answers. Additionally, we conducted a physician validation to assess the quality of generated content beyond automated metrics.

Results: The fine-tuned models and RAG framework consistently enhanced text generation tasks. For example, the fine-tuned models improved the machine translation task by 20.27 in terms of BLEU score. In the question-answering task, the RAG framework raised the ROUGE-L score by 18% over the vanilla models. Physician validation of generated answers showed high scores for readability (4.95/5) and relevancy (4.43/5), with a lower score for accuracy (3.90/5) and completeness (3.31/5).

Conclusions: This study introduces the development and evaluation of Ascle, a user-friendly NLP toolkit designed for medical text generation. All code is publicly available through the Ascle GitHub repository. All fine-tuned language models can be accessed through Hugging Face.

Keywords: deep learning; generative artificial intelligence; healthcare; large language models; machine learning; natural language processing; retrieval-augmented generation.

©Rui Yang, Qingcheng Zeng, Keen You, Yujie Qiao, Lucas Huang, Chia-Chun Hsieh, Benjamin Rosand, Jeremy Goldwasser, Amisha Dave, Tiarnan Keenan, Yuhe Ke, Chuan Hong, Nan Liu, Emily Chew, Dragomir Radev, Zhiyong Lu, Hua Xu, Qingyu Chen, Irene Li. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 03.10.2024.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
The overall architecture of Ascle. indicates that we have our fine-tuned language models for this task. indicates that we conducted evaluations for this task. POS: Parts-Of-Speech; QA: Question-Answering; UMLS: Unified Medical Language System.

formula image — **Figure 1**
The overall architecture of Ascle. indicates that we have our fine-tuned language models for this task. indicates that we conducted evaluations for this task. POS: Parts-Of-Speech; QA: Question-Answering; UMLS: Unified Medical Language System.

**Figure 2**
(A) Physician validation (readability, relevancy, accuracy, and completeness) for 50 question-answer pairs. (B) Two examples of generated answers with ground truth.

**Figure 3**
Demonstration of system usage. We show two use cases: Text Simplification and Machine Translation.

See this image and copyright information in PMC

Update of

Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation.
Yang R, Zeng Q, You K, Qiao Y, Huang L, Hsieh CC, Rosand B, Goldwasser J, Dave AD, Keenan TDL, Chew EY, Radev D, Lu Z, Xu H, Chen Q, Li I. Yang R, et al. ArXiv [Preprint]. 2023 Dec 9:arXiv:2311.16588v2. ArXiv. 2023. Update in: J Med Internet Res. 2024 Oct 3;26:e60601. doi: 10.2196/60601. PMID: 41031083 Free PMC article. Updated. Preprint.

References

1. Li I, Yasunaga M, Nuzumlalı MY, Caraballo C, Mahajan S, Krumholz H, Radev D. A neural topic-attention model for medical term abbreviation disambiguation. ArXiv. Preprint posted online on October 30, 2019. 2019 doi: 10.5260/chara.21.2.8. https://arxiv.org/abs/1910.14076 - DOI
1. Li I, Pan J, Goldwasser J, Verma N, Wong WP, Nuzumlalı MY, Rosand B, Li Y, Zhang M, Chang D, Taylor RA, Krumholz HM, Radev D. Neural natural language processing for unstructured data in electronic health records: a review. Computer Science Review. 2022;46:100511. doi: 10.1016/j.cosrev.2022.100511. - DOI
1. Shickel B, Tighe PJ, Bihorac A, Rashidi P. Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE J Biomed Health Inform. 2018;22(5):1589–1604. doi: 10.1109/JBHI.2017.2767063. https://europepmc.org/abstract/MED/29989977 - DOI - PMC - PubMed
1. al-Aiad A, Duwairi R, Fraihat M. Survey: deep learning concepts and techniques for electronic health record. IEEE/ACS 15th International Conference on Computer Systems and Applications (AICCSA); 2018 October 28 - 2018 November 01; Aqaba, Jordan. IEEE; 2018. pp. 1–5. - DOI
1. Zhang Y, Chen Q, Yang Z, Lin H, Lu Z. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Sci Data. 2019;6(1):52. doi: 10.1038/s41597-019-0055-0. http://europepmc.org/abstract/MED/31076572 10.1038/s41597-019-0055-0 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- JMIR Publications
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Ascle-A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study

Affiliations

Ascle-A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous