Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec;18 Suppl 1(Suppl 1):i109-15.
doi: 10.1136/amiajnl-2011-000463. Epub 2011 Nov 23.

Exploiting time in electronic health record correlations

Affiliations

Exploiting time in electronic health record correlations

George Hripcsak et al. J Am Med Inform Assoc. 2011 Dec.

Abstract

Objective: To demonstrate that a large, heterogeneous clinical database can reveal fine temporal patterns in clinical associations; to illustrate several types of associations; and to ascertain the value of exploiting time.

Materials and methods: Lagged linear correlation was calculated between seven clinical laboratory values and 30 clinical concepts extracted from resident signout notes from a 22-year, 3-million-patient database of electronic health records. Time points were interpolated, and patients were normalized to reduce inter-patient effects.

Results: The method revealed several types of associations with detailed temporal patterns. Definitional associations included low blood potassium preceding 'hypokalemia.' Low potassium preceding the drug spironolactone with high potassium following spironolactone exemplified intentional and physiologic associations, respectively. Counterintuitive results such as the fact that diseases appeared to follow their effects may be due to the workflow of healthcare, in which clinical findings precede the clinician's diagnosis of a disease even though the disease actually preceded the findings. Fully exploiting time by interpolating time points produced less noisy results.

Discussion: Electronic health records are not direct reflections of the patient state, but rather reflections of the healthcare process and the recording process. With proper techniques and understanding, and with proper incorporation of time, interpretable associations can be derived from a large clinical database.

Conclusion: A large, heterogeneous clinical database can reveal clinical associations, time is an important feature, and care must be taken to interpret the results.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None.

Figures

Figure 1
Figure 1
Temporal interpolation. For a given patient, each measured point (solid circles on both curves) is mapped to an interpolated point on the opposite curve. Concepts are mapped to 0 and 1 (absent and present), and laboratory values are continuous. Interpolation is linear between points and constant at the ends of the curves.
Figure 2
Figure 2
Correlation of laboratory values and signout note concepts. Four blood laboratory values (graphs A–D) are correlated with clinical concepts extracted from physician signout notes (respective graph legends). Linear correlation (y axis) is plotted against time lag in days (x axis). Signal to the left of 0 days implies that changes in the laboratory value preceded changes in the concept, and signal to the right implies that changes in the laboratory value followed changes in the concept. Correlation greater than zero implies that a higher laboratory value was associated with presence of the concept. See text for interpretations.
Figure 3
Figure 3
Patients with few values. The linear correlation between blood potassium level and mention of spironolactone is shown, comparing all patients with relevant data versus only patients with 10 or fewer laboratory values. Linear correlation (y axis) is plotted against time lag in days (x axis).
Figure 4
Figure 4
Comparison of different temporal algorithms. Linear correlation of blood potassium with mention of spironolactone (A) and blood sodium with mention of hyponatremia (B) is shown, comparing four temporal algorithms (see text). Linear correlation (y axis) is plotted against time lag in days (x axis).
Figure 5
Figure 5
Inducing low correlation. For the hyponatremia–sodium correlation, a lower correlation is induced by scrambling a proportion of the patients. When 70% of the patients have their values scrambled in time, the height of the correlation drops approximately 70%. When 100% of the patients have their values scrambled, the correlation disappears.

References

    1. Blumenthal D, Tavenner M. The “meaningful use” regulation for electronic health records. N Engl J Med 2010;363:501–4 - PubMed
    1. Friedman CP, Wong AK, Blumenthal D. Achieving a nationwide learning health system. Sci Transl Med 2010;2:57cm29 - PubMed
    1. Bellazzi R, Larizza C, Magni P, et al. Intelligent analysis of clinical time series: an application in the diabetes mellitus domain. Artif Intell Med 2000;20:37–57 - PubMed
    1. Sacchi L, Larizza C, Combi C, et al. Data mining with temporal abstractions: learning rules from time series. Data Min Knowl Discov 2007;15:217–47
    1. Allen JF. Maintaining knowledge about temporal intervals. Commun ACM 1983;26:832–43

Publication types