Artificial intelligence-based mining of electronic health record data to accelerate the digital transformation of the national cardiovascular ecosystem: design protocol of the CardioMining study
- PMID: 37012018
- PMCID: PMC10083759
- DOI: 10.1136/bmjopen-2022-068698
Artificial intelligence-based mining of electronic health record data to accelerate the digital transformation of the national cardiovascular ecosystem: design protocol of the CardioMining study
Abstract
Introduction: Mining of electronic health record (EHRs) data is increasingly being implemented all over the world but mainly focuses on structured data. The capabilities of artificial intelligence (AI) could reverse the underusage of unstructured EHR data and enhance the quality of medical research and clinical care. This study aims to develop an AI-based model to transform unstructured EHR data into an organised, interpretable dataset and form a national dataset of cardiac patients.
Methods and analysis: CardioMining is a retrospective, multicentre study based on large, longitudinal data obtained from unstructured EHRs of the largest tertiary hospitals in Greece. Demographics, hospital administrative data, medical history, medications, laboratory examinations, imaging reports, therapeutic interventions, in-hospital management and postdischarge instructions will be collected, coupled with structured prognostic data from the National Institute of Health. The target number of included patients is 100 000. Natural language processing techniques will facilitate data mining from the unstructured EHRs. The accuracy of the automated model will be compared with the manual data extraction by study investigators. Machine learning tools will provide data analytics. CardioMining aims to cultivate the digital transformation of the national cardiovascular system and fill the gap in medical recording and big data analysis using validated AI techniques.
Ethics and dissemination: This study will be conducted in keeping with the International Conference on Harmonisation Good Clinical Practice guidelines, the Declaration of Helsinki, the Data Protection Code of the European Data Protection Authority and the European General Data Protection Regulation. The Research Ethics Committee of the Aristotle University of Thessaloniki and Scientific and Ethics Council of the AHEPA University Hospital have approved this study. Study findings will be disseminated through peer-reviewed medical journals and international conferences. International collaborations with other cardiovascular registries will be attempted.
Trial registration number: NCT05176769.
Keywords: CARDIOLOGY; Health informatics; Heart failure; Ischaemic heart disease; Risk management.
© Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
Conflict of interest statement
Competing interests: None declared.
Figures



Similar articles
-
Real world evidence in cardiovascular medicine: ensuring data validity in electronic health record-based studies.J Am Med Inform Assoc. 2019 Nov 1;26(11):1189-1194. doi: 10.1093/jamia/ocz119. J Am Med Inform Assoc. 2019. PMID: 31414700 Free PMC article.
-
A method for cohort selection of cardiovascular disease records from an electronic health record system.Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30. Int J Med Inform. 2017. PMID: 28495342
-
Using artificial intelligence to identify patients with migraine and associated symptoms and conditions within electronic health records.BMC Med Inform Decis Mak. 2023 Jul 14;23(1):121. doi: 10.1186/s12911-023-02190-8. BMC Med Inform Decis Mak. 2023. PMID: 37452338 Free PMC article.
-
Applications of Artificial Intelligence to Electronic Health Record Data in Ophthalmology.Transl Vis Sci Technol. 2020 Feb 27;9(2):13. doi: 10.1167/tvst.9.2.13. Transl Vis Sci Technol. 2020. PMID: 32704419 Free PMC article. Review.
-
Artificial intelligence approaches using natural language processing to advance EHR-based clinical research.J Allergy Clin Immunol. 2020 Feb;145(2):463-469. doi: 10.1016/j.jaci.2019.12.897. Epub 2019 Dec 26. J Allergy Clin Immunol. 2020. PMID: 31883846 Free PMC article. Review.
Cited by
-
Integrating the Polysocial Risk Score: Enhancing Comprehensive Healthcare Delivery.Methodist Debakey Cardiovasc J. 2024 Nov 5;20(5):89-97. doi: 10.14797/mdcvj.1479. eCollection 2024. Methodist Debakey Cardiovasc J. 2024. PMID: 39525375 Free PMC article. Review.
-
Ethical and privacy challenges of integrating generative AI into EHR systems in Tanzania: A scoping review with a policy perspective.Digit Health. 2025 May 20;11:20552076251344385. doi: 10.1177/20552076251344385. eCollection 2025 Jan-Dec. Digit Health. 2025. PMID: 40400763 Free PMC article.
-
Facilitators and Barriers to Uptake of Drug-Drug Interaction Alerts: Perspectives of Australian End Users and Managers.Appl Clin Inform. 2025 Mar;16(2):295-304. doi: 10.1055/a-2481-4221. Epub 2025 Apr 2. Appl Clin Inform. 2025. PMID: 40174880
-
Integrating Omics Data and AI for Cancer Diagnosis and Prognosis.Cancers (Basel). 2024 Jul 3;16(13):2448. doi: 10.3390/cancers16132448. Cancers (Basel). 2024. PMID: 39001510 Free PMC article. Review.
-
Considerations and Challenges When Using Clinical and Vital Record Review for Suicide Research.J Patient Saf. 2025 Apr 1;21(3):e8-e17. doi: 10.1097/PTS.0000000000001325. Epub 2025 Feb 11. J Patient Saf. 2025. PMID: 39927831 No abstract available.
References
MeSH terms
Associated data
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous