Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2021 Mar;73(3):442-448.
doi: 10.1002/acr.24132.

Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data

Affiliations
Comparative Study

Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data

Sara K Tedeschi et al. Arthritis Care Res (Hoboken). 2021 Mar.

Abstract

Objective: Identifying pseudogout in large data sets is difficult due to its episodic nature and a lack of billing codes specific to this acute subtype of calcium pyrophosphate (CPP) deposition disease. The objective of this study was to evaluate a novel machine learning approach for classifying pseudogout using electronic health record (EHR) data.

Methods: We created an EHR data mart of patients with ≥1 relevant billing code or ≥2 natural language processing (NLP) mentions of pseudogout or chondrocalcinosis, 1991-2017. We selected 900 subjects for gold standard chart review for definite pseudogout (synovitis + synovial fluid CPP crystals), probable pseudogout (synovitis + chondrocalcinosis), or not pseudogout. We applied a topic modeling approach to identify definite/probable pseudogout. A combined algorithm included topic modeling plus manually reviewed CPP crystal results. We compared algorithm performance and cohorts identified by billing codes, the presence of CPP crystals, topic modeling, and a combined algorithm.

Results: Among 900 subjects, 123 (13.7%) had pseudogout by chart review (68 definite, 55 probable). Billing codes had a sensitivity of 65% and a positive predictive value (PPV) of 22% for pseudogout. The presence of CPP crystals had a sensitivity of 29% and a PPV of 92%. Without using CPP crystal results, topic modeling had a sensitivity of 29% and a PPV of 79%. The combined algorithm yielded a sensitivity of 42% and a PPV of 81%. The combined algorithm identified 50% more patients than the presence of CPP crystals; the latter captured a portion of definite pseudogout and missed probable pseudogout.

Conclusion: For pseudogout, an episodic disease with no specific billing code, combining NLP, machine learning methods, and synovial fluid laboratory results yielded an algorithm that significantly boosted the PPV compared to billing codes.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Novel machine learning approach to classifying definite/probable pseudogout in an electronic health record dataset.

Similar articles

Cited by

References

    1. Zhang W, Doherty M, Bardin T, et al. European League Against Rheumatism recommendations for calcium pyrophosphate deposition. Part I: terminology and diagnosis. Annals of the rheumatic diseases 2011;70:563–70. - PubMed
    1. Abhishek A, Neogi T, Choi H, Doherty M, Rosenthal AK, Terkeltaub R. Review: Unmet Needs and the Path Forward in Joint Disease Associated With Calcium Pyrophosphate Crystal Deposition. Arthritis & rheumatology 2018;70:1182–91. - PMC - PubMed
    1. Tedeschi SK, Solomon DH, Liao KP. Pseudogout among Patients Fulfilling a Billing Code Algorithm for Calcium Pyrophosphate Deposition Disease. Rheumatology international 2018;38:1083–8. - PMC - PubMed
    1. Zhang Y, Cai T, Yu S, et al. Methods for high-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP). Nature Protocols 2019. - PMC - PubMed
    1. Liu L, Tang L, Dong W, Yao S, Zhou W. An overview of topic modeling and its current applications in bioinformatics. Springerplus 2016;5:1608. - PMC - PubMed

Publication types

LinkOut - more resources