Automating case definitions using literature-based reasoning
- PMID: 24454579
- PMCID: PMC3885912
- DOI: 10.4338/ACI-2013-04-RA-0028
Automating case definitions using literature-based reasoning
Abstract
Background: Establishing a Case Definition (CDef) is a first step in many epidemiological, clinical, surveillance, and research activities. The application of CDefs still relies on manual steps and this is a major source of inefficiency in surveillance and research.
Objective: Describe the need and propose an approach for automating the useful representation of CDefs for medical conditions.
Methods: We translated the existing Brighton Collaboration CDef for anaphylaxis by mostly relying on the identification of synonyms for the criteria of the CDef using the NLM MetaMap tool. We also generated a CDef for the same condition using all the related PubMed abstracts, processing them with a text mining tool, and further treating the synonyms with the above strategy. The co-occurrence of the anaphylaxis and any other medical term within the same sentence of the abstracts supported the construction of a large semantic network. The 'islands' algorithm reduced the network and revealed its densest region including the nodes that were used to represent the key criteria of the CDef. We evaluated the ability of the "translated" and the "generated" CDef to classify a set of 6034 H1N1 reports for anaphylaxis using two similarity approaches and comparing them with our previous semi-automated classification approach.
Results: Overall classification performance across approaches to producing CDefs was similar, with the generated CDef and vector space model with cosine similarity having the highest accuracy (0.825 ± 0.003) and the semi-automated approach and vector space model with cosine similarity having the highest recall (0.809 ± 0.042). Precision was low for all approaches.
Conclusion: The useful representation of CDefs is a complicated task but potentially offers substantial gains in efficiency to support safety and clinical surveillance.
Keywords: Case definition; anaphylaxis; literature-based reasoning; safety surveillance; semantic networks; similarity.
Figures


Similar articles
-
Application of information retrieval approaches to case classification in the vaccine adverse event reporting system.Drug Saf. 2013 Jul;36(7):573-82. doi: 10.1007/s40264-013-0064-4. Drug Saf. 2013. PMID: 23703591
-
Multiscale Monte Carlo simulations of gold nanoparticle dose-enhanced radiotherapy II. Cellular dose enhancement within macroscopic tumor models.Med Phys. 2023 Sep;50(9):5842-5852. doi: 10.1002/mp.16460. Epub 2023 May 29. Med Phys. 2023. PMID: 37246723
-
Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection.J Am Med Inform Assoc. 2011 Sep-Oct;18(5):631-8. doi: 10.1136/amiajnl-2010-000022. Epub 2011 Jun 27. J Am Med Inform Assoc. 2011. PMID: 21709163 Free PMC article.
-
Graph-based biomedical text summarization: An itemset mining and sentence clustering approach.J Biomed Inform. 2018 Aug;84:42-58. doi: 10.1016/j.jbi.2018.06.005. Epub 2018 Jun 15. J Biomed Inform. 2018. PMID: 29906584
-
SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.Artif Intell Med. 2020 Jan;102:101767. doi: 10.1016/j.artmed.2019.101767. Epub 2019 Nov 28. Artif Intell Med. 2020. PMID: 31980104
Cited by
-
Improving Methods of Identifying Anaphylaxis for Medical Product Safety Surveillance Using Natural Language Processing and Machine Learning.Am J Epidemiol. 2023 Feb 1;192(2):283-295. doi: 10.1093/aje/kwac182. Am J Epidemiol. 2023. PMID: 36331289 Free PMC article.
-
Use of data mining at the Food and Drug Administration.J Am Med Inform Assoc. 2016 Mar;23(2):428-34. doi: 10.1093/jamia/ocv063. Epub 2015 Jul 23. J Am Med Inform Assoc. 2016. PMID: 26209436 Free PMC article. Review.
-
"Artificial Intelligence" for Pharmacovigilance: Ready for Prime Time?Drug Saf. 2022 May;45(5):429-438. doi: 10.1007/s40264-022-01157-4. Epub 2022 May 17. Drug Saf. 2022. PMID: 35579808 Free PMC article.
-
Monitoring biomedical literature for post-market safety purposes by analyzing networks of text-based coded information.AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:66-75. eCollection 2017. AMIA Jt Summits Transl Sci Proc. 2017. PMID: 28815108 Free PMC article.
References
-
- Merrill R. Introduction to Epidemiology. 5th ed Jones & Bartlett Learning;2010
-
- Ghanaie RM, Karimi A, Sadeghi H, Esteghamti A, Falah F, Armin S, Fahimzad A, Shamshiri A, Kahbazi M, Shiva F. Sensitivity and specificity of the World Health Organization pertussis clinical case definition. International Journal of Infectious Diseases 2010; 14(12): e1072–e1075 - PubMed
-
- CDC National Notifiable Diseases Surveillance System (NNDSS). December 7, 2012. Available from: http://wwwn.cdc.gov/nndss/.
-
- Koo D, Wharton M, Birkhead G. Case Definitions for Infectious Conditions Under Public Health Surveillance. MMWR Recomm Rep 1997; 46(RR-10): 1–64 - PubMed
-
- Wharton M, Chorba TL, Vogt RL, Morse DL, Buehler JW. Case definitions for public health surveillance. MMWR Recomm Rep 1990; 39(RR-13): 1–43 - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources