Machine learning in medicine: a practical introduction to natural language processing

doi:10.1186/s12874-021-01347-1

. 2021 Jul 31;21(1):158.

doi: 10.1186/s12874-021-01347-1.

Machine learning in medicine: a practical introduction to natural language processing

Conrad J Harrison¹, Chris J Sidey-Gibbons²

Affiliations

¹ Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK. conrad.harrison@medsci.ox.ac.uk.
² MD Anderson Center for INSPiRED Cancer Care, Department of Symptom Research, University of Texas MD Anderson Cancer Center, Houston, TX, USA.

PMID: 34332525
PMCID: PMC8325804
DOI: 10.1186/s12874-021-01347-1

Machine learning in medicine: a practical introduction to natural language processing

Conrad J Harrison et al. BMC Med Res Methodol. 2021.

. 2021 Jul 31;21(1):158.

doi: 10.1186/s12874-021-01347-1.

Authors

Conrad J Harrison¹, Chris J Sidey-Gibbons²

Affiliations

¹ Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK. conrad.harrison@medsci.ox.ac.uk.
² MD Anderson Center for INSPiRED Cancer Care, Department of Symptom Research, University of Texas MD Anderson Cancer Center, Houston, TX, USA.

PMID: 34332525
PMCID: PMC8325804
DOI: 10.1186/s12874-021-01347-1

Abstract

Background: Unstructured text, including medical records, patient feedback, and social media comments, can be a rich source of data for clinical research. Natural language processing (NLP) describes a set of techniques used to convert passages of written text into interpretable datasets that can be analysed by statistical and machine learning (ML) models. The purpose of this paper is to provide a practical introduction to contemporary techniques for the analysis of text-data, using freely-available software.

Methods: We performed three NLP experiments using publicly-available data obtained from medicine review websites. First, we conducted lexicon-based sentiment analysis on open-text patient reviews of four drugs: Levothyroxine, Viagra, Oseltamivir and Apixaban. Next, we used unsupervised ML (latent Dirichlet allocation, LDA) to identify similar drugs in the dataset, based solely on their reviews. Finally, we developed three supervised ML algorithms to predict whether a drug review was associated with a positive or negative rating. These algorithms were: a regularised logistic regression, a support vector machine (SVM), and an artificial neural network (ANN). We compared the performance of these algorithms in terms of classification accuracy, area under the receiver operating characteristic curve (AUC), sensitivity and specificity.

Results: Levothyroxine and Viagra were reviewed with a higher proportion of positive sentiments than Oseltamivir and Apixaban. One of the three LDA clusters clearly represented drugs used to treat mental health problems. A common theme suggested by this cluster was drugs taking weeks or months to work. Another cluster clearly represented drugs used as contraceptives. Supervised machine learning algorithms predicted positive or negative drug ratings with classification accuracies ranging from 0.664, 95% CI [0.608, 0.716] for the regularised regression to 0.720, 95% CI [0.664,0.776] for the SVM.

Conclusions: In this paper, we present a conceptual overview of common techniques used to analyse large volumes of text, and provide reproducible code that can be readily applied to other research studies using open-source software.

PubMed Disclaimer

Conflict of interest statement

The authors have no competing interests to declare in relation to this work.

Figures

**Fig. 1**
Creating a document term matrix from the data

**Fig. 2**
A part of the document term matrix

**Fig. 3**
Latent Dirichlet allocation can be performed with a short passage of code

**Fig. 4**
Splitting data into training and test sets

See this image and copyright information in PMC

Cited by

Urban resilience to socioeconomic disruptions during the COVID-19 pandemic: Evidence from China.
Yuan Z, Hu W. Yuan Z, et al. Int J Disaster Risk Reduct. 2023 Jun 1;91:103670. doi: 10.1016/j.ijdrr.2023.103670. Epub 2023 Apr 5. Int J Disaster Risk Reduct. 2023. PMID: 37041883 Free PMC article.
Artificial Intelligence for Clinical Management of Male Infertility, a Scoping Review.
Naik N, Roth B, Lundy SD. Naik N, et al. Curr Urol Rep. 2024 Nov 9;26(1):17. doi: 10.1007/s11934-024-01239-z. Curr Urol Rep. 2024. PMID: 39520645 Free PMC article.
Externally validated and clinically useful machine learning algorithms to support patient-related decision-making in oncology: a scoping review.
Santos CS, Amorim-Lopes M. Santos CS, et al. BMC Med Res Methodol. 2025 Feb 21;25(1):45. doi: 10.1186/s12874-025-02463-y. BMC Med Res Methodol. 2025. PMID: 39984835 Free PMC article.
Leveraging Artificial Intelligence and Data Science for Integration of Social Determinants of Health in Emergency Medicine: Scoping Review.
Abbott EE, Apakama D, Richardson LD, Chan L, Nadkarni GN. Abbott EE, et al. JMIR Med Inform. 2024 Oct 30;12:e57124. doi: 10.2196/57124. JMIR Med Inform. 2024. PMID: 39475815 Free PMC article.
A case study on generative artificial intelligence to extract the fundamental sleep parameters from polysomnography notes.
Maghsoudi A, Sharafkhaneh A, Azarian M, Ramezani A, Hirshkowitz M, Razjouyan J. Maghsoudi A, et al. J Clin Sleep Med. 2025 Jun 1;21(6):1123-1127. doi: 10.5664/jcsm.11594. J Clin Sleep Med. 2025. PMID: 40012317

See all "Cited by" articles

References

1. Lee CH, Yoon HJ. Medical big data: promise and challenges. Kidney Res Clin Pract. 2017 doi: 10.23876/j.krcp.2017.36.1.3. - DOI - PMC - PubMed
1. Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019 doi: 10.1186/s12874-019-0681-4. - DOI - PMC - PubMed
1. Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017 doi: 10.1038/nature21056. - DOI - PMC - PubMed
1. Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Informatics Assoc. 2011 doi: 10.1136/amiajnl-2011-000464. - DOI - PMC - PubMed
1. Gravesteijn BY, Nieboer D, Ercole A, et al. Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. J Clin Epidemiol. 2020 doi: 10.1016/j.jclinepi.2020.03.005. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

[1] Lee CH, Yoon HJ. Medical big data: promise and challenges. Kidney Res Clin Pract. 2017 doi: 10.23876/j.krcp.2017.36.1.3. - DOI - PMC - PubMed

[2] Lee CH, Yoon HJ. Medical big data: promise and challenges. Kidney Res Clin Pract. 2017 doi: 10.23876/j.krcp.2017.36.1.3. - DOI - PMC - PubMed

[3] Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019 doi: 10.1186/s12874-019-0681-4. - DOI - PMC - PubMed

[4] Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019 doi: 10.1186/s12874-019-0681-4. - DOI - PMC - PubMed

[5] Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017 doi: 10.1038/nature21056. - DOI - PMC - PubMed

[6] Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017 doi: 10.1038/nature21056. - DOI - PMC - PubMed

[7] Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Informatics Assoc. 2011 doi: 10.1136/amiajnl-2011-000464. - DOI - PMC - PubMed

[8] Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Informatics Assoc. 2011 doi: 10.1136/amiajnl-2011-000464. - DOI - PMC - PubMed

[9] Gravesteijn BY, Nieboer D, Ercole A, et al. Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. J Clin Epidemiol. 2020 doi: 10.1016/j.jclinepi.2020.03.005. - DOI - PubMed

[10] Gravesteijn BY, Nieboer D, Ercole A, et al. Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. J Clin Epidemiol. 2020 doi: 10.1016/j.jclinepi.2020.03.005. - DOI - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning in medicine: a practical introduction to natural language processing

Affiliations

Machine learning in medicine: a practical introduction to natural language processing

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources