Information discovery on electronic health records using authority flow techniques
- PMID: 20969780
- PMCID: PMC2984470
- DOI: 10.1186/1472-6947-10-64
Information discovery on electronic health records using authority flow techniques
Abstract
Background: As the use of electronic health records (EHRs) becomes more widespread, so does the need to search and provide effective information discovery within them. Querying by keyword has emerged as one of the most effective paradigms for searching. Most work in this area is based on traditional Information Retrieval (IR) techniques, where each document is compared individually against the query. We compare the effectiveness of two fundamentally different techniques for keyword search of EHRs.
Methods: We built two ranking systems. The traditional BM25 system exploits the EHRs' content without regard to association among entities within. The Clinical ObjectRank (CO) system exploits the entities' associations in EHRs using an authority-flow algorithm to discover the most relevant entities. BM25 and CO were deployed on an EHR dataset of the cardiovascular division of Miami Children's Hospital. Using sequences of keywords as queries, sensitivity and specificity were measured by two physicians for a set of 11 queries related to congenital cardiac disease.
Results: Our pilot evaluation showed that CO outperforms BM25 in terms of sensitivity (65% vs. 38%) by 71% on average, while maintaining the specificity (64% vs. 61%). The evaluation was done by two physicians.
Conclusions: Authority-flow techniques can greatly improve the detection of relevant information in EHRs and hence deserve further study.
Figures






References
-
- Robertson SE, Walker S, Jones S, Proceedings of the Text Retrieval Conference (TREC) Gaithersburg; 1994. Okapi at TREC-3; pp. 109–126. (4,6)
-
- Singhal A. Modern Information Retrieval: A Brief Overview. Proceedings of IEEE Data Engineering Bulletin. 2001;24(4):35–43. (5)
-
- Robertson SE, Walker S, Beaulieu M. Okapi at TREC-7: automatic ad hoc, filtering, VLC and filtering tracks. Proceedings of the Seventh Text REtrieval Conference (TREC-7) 1999. pp. 253–264. (5)
-
- Singhal A, Buckley C, Mitra M. Proceedings of Association for Computing Machinery Special Interest Group in Information Retrieval (SIGIR) New York; 1996. Pivoted document length normalization; pp. 21–29. (5)
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials