ACE: the Advanced Cohort Engine for searching longitudinal patient records
- PMID: 33712854
- PMCID: PMC8279796
- DOI: 10.1093/jamia/ocab027
ACE: the Advanced Cohort Engine for searching longitudinal patient records
Abstract
Objective: To propose a paradigm for a scalable time-aware clinical data search, and to describe the design, implementation and use of a search engine realizing this paradigm.
Materials and methods: The Advanced Cohort Engine (ACE) uses a temporal query language and in-memory datastore of patient objects to provide a fast, scalable, and expressive time-aware search. ACE accepts data in the Observational Medicine Outcomes Partnership Common Data Model, and is configurable to balance performance with compute cost. ACE's temporal query language supports automatic query expansion using clinical knowledge graphs. The ACE API can be used with R, Python, Java, HTTP, and a Web UI.
Results: ACE offers an expressive query language for complex temporal search across many clinical data types with multiple output options. ACE enables electronic phenotyping and cohort-building with subsecond response times in searching the data of millions of patients for a variety of use cases.
Discussion: ACE enables fast, time-aware search using a patient object-centric datastore, thereby overcoming many technical and design shortcomings of relational algebra-based querying. Integrating electronic phenotype development with cohort-building enables a variety of high-value uses for a learning health system. Tradeoffs include the need to learn a new query language and the technical setup burden.
Conclusion: ACE is a tool that combines a unique query language for time-aware search of longitudinal patient records with a patient object datastore for rapid electronic phenotyping, cohort extraction, and exploratory data analyses.
Keywords: data science; electronic health records; in-memory datastore, query language, search engine.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association.
Figures
References
-
- Palmer RH. Process-based measures of quality: the need for detailed clinical data in large health care databases. Ann Intern Med 1997; 127 (8_Part_2): 733–8. - PubMed
-
- Longhurst CA, Harrington RA, Shah NH.. A ‘green button’ for using aggregate patient data at the point of care. Health Aff 2014; 33 (7): 1229–35. - PubMed
-
- Greenes RA, Pappalardo AN, Marble CW, et al.Design and implementation of a clinical data management system. Comput Biomed Res 1969; 2 (5): 469–85. - PubMed
-
- Safran C, Porter D, Rury CD, et al.ClinQuery: searching a large clinical database. MD Comput 1990; 7 (3): 144–53. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous
