Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals
- PMID: 36001371
- PMCID: PMC9453582
- DOI: 10.2196/38122
Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals
Abstract
Background: As more health care organizations transition to using electronic health record (EHR) systems, it is important for these organizations to maximize the secondary use of their data to support service improvement and clinical research. These organizations will find it challenging to have systems capable of harnessing the unstructured data fields in the record (clinical notes, letters, etc) and more practically have such systems interact with all of the hospital data systems (legacy and current).
Objective: We describe the deployment of the EHR interfacing information extraction and retrieval platform CogStack at University College London Hospitals (UCLH).
Methods: At UCLH, we have deployed the CogStack platform, an information retrieval platform with natural language processing capabilities. The platform addresses the problem of data ingestion and harmonization from multiple data sources using the Apache NiFi module for managing complex data flows. The platform also facilitates the extraction of structured data from free-text records through use of the MedCAT natural language processing library. Finally, data science tools are made available to support data scientists and the development of downstream applications dependent upon data ingested and analyzed by CogStack.
Results: The platform has been deployed at the hospital, and in particular, it has facilitated a number of research and service evaluation projects. To date, we have processed over 30 million records, and the insights produced from CogStack have informed a number of clinical research use cases at the hospital.
Conclusions: The CogStack platform can be configured to handle the data ingestion and harmonization challenges faced by a hospital. More importantly, the platform enables the hospital to unlock important clinical information from the unstructured portion of the record using natural language processing technology.
Keywords: clinical support; electronic health record system; information retrieval; natural language processing; text mining.
©Kawsar Noor, Lukasz Roguski, Xi Bai, Alex Handy, Roman Klapaukh, Amos Folarin, Luis Romao, Joshua Matteson, Nathan Lea, Leilei Zhu, Folkert W Asselbergs, Wai Keong Wong, Anoop Shah, Richard JB Dobson. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 24.08.2022.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
Similar articles
-
SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research.J Am Med Inform Assoc. 2018 May 1;25(5):530-537. doi: 10.1093/jamia/ocx160. J Am Med Inform Assoc. 2018. PMID: 29361077 Free PMC article.
-
CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.BMC Med Inform Decis Mak. 2018 Jun 25;18(1):47. doi: 10.1186/s12911-018-0623-9. BMC Med Inform Decis Mak. 2018. PMID: 29941004 Free PMC article.
-
Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit.Artif Intell Med. 2021 Jul;117:102083. doi: 10.1016/j.artmed.2021.102083. Epub 2021 May 1. Artif Intell Med. 2021. PMID: 34127232
-
Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.JMIR Med Inform. 2019 Apr 27;7(2):e12239. doi: 10.2196/12239. JMIR Med Inform. 2019. PMID: 31066697 Free PMC article. Review.
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
Cited by
-
A survey on clinical natural language processing in the United Kingdom from 2007 to 2022.NPJ Digit Med. 2022 Dec 21;5(1):186. doi: 10.1038/s41746-022-00730-6. NPJ Digit Med. 2022. PMID: 36544046 Free PMC article. Review.
-
Concept Recognition and Characterization of Patients Undergoing Resection of Vestibular Schwannoma Using Natural Language Processing.J Neurol Surg B Skull Base. 2024 May 11;86(3):332-341. doi: 10.1055/s-0044-1786738. eCollection 2025 Jun. J Neurol Surg B Skull Base. 2024. PMID: 40351873 Free PMC article.
-
Early evaluation of a natural language processing tool to improve access to educational resources for surgical patients.Eur Spine J. 2024 Jul;33(7):2545-2552. doi: 10.1007/s00586-024-08315-5. Epub 2024 May 30. Eur Spine J. 2024. PMID: 38811438 Free PMC article.
-
Breaking Bias: The Role of Artificial Intelligence in Improving Clinical Decision-Making.Cureus. 2023 Mar 20;15(3):e36415. doi: 10.7759/cureus.36415. eCollection 2023 Mar. Cureus. 2023. PMID: 37090406 Free PMC article.
-
Neural machine translation of clinical text: an empirical investigation into multilingual pre-trained language models and transfer-learning.Front Digit Health. 2024 Feb 26;6:1211564. doi: 10.3389/fdgth.2024.1211564. eCollection 2024. Front Digit Health. 2024. PMID: 38468693 Free PMC article.
References
-
- Department of Health . Delivering 21st Century IT Support for the NHS: National Strategic Programme. London, UK: Department of Health; 2002.
-
- Sheikh A, Cornford T, Barber N, Avery A, Takian A, Lichtner V, Petrakaki D, Crowe S, Marsden K, Robertson A, Morrison Z, Klecun E, Prescott R, Quinn C, Jani Y, Ficociello M, Voutsina K, Paton J, Fernando B, Jacklin A, Cresswell K. Implementation and adoption of nationwide electronic health records in secondary care in England: final qualitative results from prospective national evaluation in "early adopter" hospitals. BMJ. 2011 Oct 17;343:d6054. doi: 10.1136/bmj.d6054. http://europepmc.org/abstract/MED/22006942 - DOI - PMC - PubMed
-
- Why Unstructured Data Holds the Key to Intelligent Healthcare Systems. HIT Consultant. 2015. [2022-07-08]. https://hitconsultant.net/2015/03/31/tapping-unstructured-data-healthcar...
-
- Poulos J, Zhu L, Shah AD. Data gaps in electronic health record (EHR) systems: An audit of problem list completeness during the COVID-19 pandemic. Int J Med Inform. 2021 Jun;150:104452. doi: 10.1016/j.ijmedinf.2021.104452. https://linkinghub.elsevier.com/retrieve/pii/S1386-5056(21)00078-2 S1386-5056(21)00078-2 - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials