Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan;11(1):172-181.
doi: 10.1055/s-0040-1702214. Epub 2020 Mar 4.

Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data

Affiliations

Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data

Daniel J Feller et al. Appl Clin Inform. 2020 Jan.

Abstract

Background: Social and behavioral determinants of health (SBDH) are environmental and behavioral factors that often impede disease management and result in sexually transmitted infections. Despite their importance, SBDH are inconsistently documented in electronic health records (EHRs) and typically collected only in an unstructured format. Evidence suggests that structured data elements present in EHRs can contribute further to identify SBDH in the patient record.

Objective: Explore the automated inference of both the presence of SBDH documentation and individual SBDH risk factors in patient records. Compare the relative ability of clinical notes and structured EHR data, such as laboratory measurements and diagnoses, to support inference.

Methods: We attempt to infer the presence of SBDH documentation in patient records, as well as patient status of 11 SBDH, including alcohol abuse, homelessness, and sexual orientation. We compare classification performance when considering clinical notes only, structured data only, and notes and structured data together. We perform an error analysis across several SBDH risk factors.

Results: Classification models inferring the presence of SBDH documentation achieved good performance (F1 score: 92.7-78.7; F1 considered as the primary evaluation metric). Performance was variable for models inferring patient SBDH risk status; results ranged from F1 = 82.7 for LGBT (lesbian, gay, bisexual, and transgender) status to F1 = 28.5 for intravenous drug use. Error analysis demonstrated that lexical diversity and documentation of historical SBDH status challenge inference of patient SBDH status. Three of five classifiers inferring topic-specific SBDH documentation and 10 of 11 patient SBDH status classifiers achieved highest performance when trained using both clinical notes and structured data.

Conclusion: Our findings suggest that combining clinical free-text notes and structured data provide the best approach in classifying patient SBDH status. Inferring patient SBDH status is most challenging among SBDH with low prevalence and high lexical diversity.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Fig. 1
Fig. 1
Prediction Task for SBDH labels. SBDH, social and behavioral determinants of health.
Fig. 2
Fig. 2
Overview of methods for machine learning. TF-IDF, term frequency–inverse document frequency; SVC, support vector classifier.
Fig. 3
Fig. 3
Relationship between SBDH prevalence and classifier performance. LGBT, lesbian, gay, bisexual, transgender; SBDH, social and behavioral determinants of health; STI, sexually transmitted disease.

References

    1. Cohen S M, Hu X, Sweeney P, Johnson A S, Hall H I. HIV viral suppression among persons with varying levels of engagement in HIV medical care, 19 US jurisdictions. J Acquir Immune Defic Syndr. 2014;67(05):519–527. - PMC - PubMed
    1. Facente S N, Pilcher C D, Hartogensis W E et al.Performance of risk-based criteria for targeting acute HIV screening in San Francisco. PLoS One. 2011;6(07):e21813–e21813. - PMC - PubMed
    1. Haukoos J S, Hopkins E, Bender B, Sasson C, Al-Tayyib A A, Thrun M W; Denver Emergency Department HIV Testing Research Consortium.Comparison of enhanced targeted rapid HIV screening using the Denver HIV risk score to nontargeted rapid HIV screening in the emergency department Ann Emerg Med 20136103353–361. - PMC - PubMed
    1. Lauby J, Zhu L, Milnamow M et al.Get real: evaluation of a community-level HIV Prevention intervention for young MSM who engage in episodic substance use. AIDS Educ Prev. 2017;29(03):191–204. - PubMed
    1. Gottlieb L M, Tirozzi K J, Manchanda R, Burns A R, Sandel M T. Moving electronic medical records upstream: incorporating social determinants of health. Am J Prev Med. 2015;48(02):215–218. - PubMed

Publication types