Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2017 Jun;65(6):1753-1761.
doi: 10.1016/j.jvs.2016.11.031. Epub 2017 Feb 8.

Mining peripheral arterial disease cases from narrative clinical notes using natural language processing

Affiliations
Comparative Study

Mining peripheral arterial disease cases from narrative clinical notes using natural language processing

Naveed Afzal et al. J Vasc Surg. 2017 Jun.

Abstract

Objective: Lower extremity peripheral arterial disease (PAD) is highly prevalent and affects millions of individuals worldwide. We developed a natural language processing (NLP) system for automated ascertainment of PAD cases from clinical narrative notes and compared the performance of the NLP algorithm with billing code algorithms, using ankle-brachial index test results as the gold standard.

Methods: We compared the performance of the NLP algorithm to (1) results of gold standard ankle-brachial index; (2) previously validated algorithms based on relevant International Classification of Diseases, Ninth Revision diagnostic codes (simple model); and (3) a combination of International Classification of Diseases, Ninth Revision codes with procedural codes (full model). A dataset of 1569 patients with PAD and controls was randomly divided into training (n = 935) and testing (n = 634) subsets.

Results: We iteratively refined the NLP algorithm in the training set including narrative note sections, note types, and service types, to maximize its accuracy. In the testing dataset, when compared with both simple and full models, the NLP algorithm had better accuracy (NLP, 91.8%; full model, 81.8%; simple model, 83%; P < .001), positive predictive value (NLP, 92.9%; full model, 74.3%; simple model, 79.9%; P < .001), and specificity (NLP, 92.5%; full model, 64.2%; simple model, 75.9%; P < .001).

Conclusions: A knowledge-driven NLP algorithm for automatic ascertainment of PAD cases from clinical notes had greater accuracy than billing code algorithms. Our findings highlight the potential of NLP tools for rapid and efficient ascertainment of PAD cases from electronic health records to facilitate clinical investigation and eventually improve care by clinical decision support.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Dataset Description
Figure 2
Figure 2. Study Design
Figure 3
Figure 3. PAD Concept Visualization
Figure 4
Figure 4. Accuracy of NLP algorithm compared with billing code algorithms (simple model and full model) for ascertainment of PAD status

References

    1. Criqui MH, Denenberg JO, Langer RD, Fronek A. The epidemiology of peripheral arterial disease: importance of identifying the population at risk. Vascular Medicine. 1997;2:221–226. - PubMed
    1. Hirsch AT, Criqui MH, Treat-Jacobson D, Regensteiner JG, Creager MA, Olin JW, et al. Peripheral arterial disease detection, awareness, and treatment in primary care. Jama. 2001;286:1317–1324. - PubMed
    1. Hirsch AT, Haskal ZJ, Hertzer NR, Bakal CW, Creager MA, Halperin JL, et al. ACC/AHA 2005 Guidelines for the Management of Patients With Peripheral Arterial Disease (Lower Extremity, Renal, Mesenteric, and Abdominal Aortic): A Collaborative Report from the American Association for Vascular Surgery/Society for Vascular Surgery,* Society for Cardiovascular Angiography and Interventions, Society for Vascular Medicine and Biology, Society of Interventional Radiology, and the ACC/AHA Task Force on Practice Guidelines (Writing Committee to Develop Guidelines for the Management of Patients With Peripheral Arterial Disease) J AM Coll Cardiol. 2006;47:e1–e192.
    1. Kullo IJ, Rooke TW. Peripheral Artery Disease. N Engl J Med. 2016;2016:861–871. - PubMed
    1. Murabito JM, D’Agostino RB, Silbershatz H, Wilson PW. Intermittent claudication a risk profile from the Framingham heart study. Circulation. 1997;96:44–49. - PubMed

Publication types