Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 14;28(7):1468-1479.
doi: 10.1093/jamia/ocab027.

ACE: the Advanced Cohort Engine for searching longitudinal patient records

Affiliations

ACE: the Advanced Cohort Engine for searching longitudinal patient records

Alison Callahan et al. J Am Med Inform Assoc. .

Abstract

Objective: To propose a paradigm for a scalable time-aware clinical data search, and to describe the design, implementation and use of a search engine realizing this paradigm.

Materials and methods: The Advanced Cohort Engine (ACE) uses a temporal query language and in-memory datastore of patient objects to provide a fast, scalable, and expressive time-aware search. ACE accepts data in the Observational Medicine Outcomes Partnership Common Data Model, and is configurable to balance performance with compute cost. ACE's temporal query language supports automatic query expansion using clinical knowledge graphs. The ACE API can be used with R, Python, Java, HTTP, and a Web UI.

Results: ACE offers an expressive query language for complex temporal search across many clinical data types with multiple output options. ACE enables electronic phenotyping and cohort-building with subsecond response times in searching the data of millions of patients for a variety of use cases.

Discussion: ACE enables fast, time-aware search using a patient object-centric datastore, thereby overcoming many technical and design shortcomings of relational algebra-based querying. Integrating electronic phenotype development with cohort-building enables a variety of high-value uses for a learning health system. Tradeoffs include the need to learn a new query language and the technical setup burden.

Conclusion: ACE is a tool that combines a unique query language for time-aware search of longitudinal patient records with a patient object datastore for rapid electronic phenotyping, cohort extraction, and exploratory data analyses.

Keywords: data science; electronic health records; in-memory datastore, query language, search engine.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Summary view showing the number of patients meeting the search criteria, a summary of their demographics (histograms of age, race/ethnicity, and length of record), and their most frequently occurring diagnosis, procedure, medication, and laboratory test records.
Figure 2.
Figure 2.
Patient timeline view, displaying each patient as a row and showing the time intervals where a given search criterion was satisfied in different colors. For example, glipizide prescription records following type II diabetes diagnosis are shown in green, and subsequent stroke events are shown in pink.

References

    1. Palmer RH. Process-based measures of quality: the need for detailed clinical data in large health care databases. Ann Intern Med 1997; 127 (8_Part_2): 733–8. - PubMed
    1. Longhurst CA, Harrington RA, Shah NH.. A ‘green button’ for using aggregate patient data at the point of care. Health Aff 2014; 33 (7): 1229–35. - PubMed
    1. Hripcsak G, Ryan PB, Duke JD, et al.Characterizing treatment pathways at scale using the OHDSI network. Proc Natl Acad Sci USA 2016; 6: 7329–36. doi: 10.1073/pnas.1510502113. - DOI - PMC - PubMed
    1. Greenes RA, Pappalardo AN, Marble CW, et al.Design and implementation of a clinical data management system. Comput Biomed Res 1969; 2 (5): 469–85. - PubMed
    1. Safran C, Porter D, Rury CD, et al.ClinQuery: searching a large clinical database. MD Comput 1990; 7 (3): 144–53. - PubMed

Publication types