Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Nov;8(2):e001725.
doi: 10.1136/bmjdrc-2020-001725.

Multimethod, multidataset analysis reveals paradoxical relationships between sociodemographic factors, Hispanic ethnicity and diabetes

Affiliations

Multimethod, multidataset analysis reveals paradoxical relationships between sociodemographic factors, Hispanic ethnicity and diabetes

Gabriel M Knight et al. BMJ Open Diabetes Res Care. 2020 Nov.

Abstract

Introduction: Population-level and individual-level analyses have strengths and limitations as do 'blackbox' machine learning (ML) and traditional, interpretable models. Diabetes mellitus (DM) is a leading cause of morbidity and mortality with complex sociodemographic dynamics that have not been analyzed in a way that leverages population-level and individual-level data as well as traditional epidemiological and ML models. We analyzed complementary individual-level and county-level datasets with both regression and ML methods to study the association between sociodemographic factors and DM.

Research design and methods: County-level DM prevalence, demographics, and socioeconomic status (SES) factors were extracted from the 2018 Robert Wood Johnson Foundation County Health Rankings and merged with US Census data. Analogous individual-level data were extracted from 2007 to 2016 National Health and Nutrition Examination Survey studies and corrected for oversampling with survey weights. We used multivariate linear (logistic) regression and ML regression (classification) models for county (individual) data. Regression and ML models were compared using measures of explained variation (area under the receiver operating characteristic curve (AUC) and R2).

Results: Among the 3138 counties assessed, the mean DM prevalence was 11.4% (range: 3.0%-21.1%). Among the 12 824 individuals assessed, 1688 met DM criteria (13.2% unweighted; 10.2% weighted). Age, gender, race/ethnicity, income, and education were associated with DM at the county and individual levels. Higher county Hispanic ethnic density was negatively associated with county DM prevalence, while Hispanic ethnicity was positively associated with individual DM. ML outperformed regression in both datasets (mean R2 of 0.679 vs 0.610, respectively (p<0.001) for county-level data; mean AUC of 0.737 vs 0.727 (p<0.0427) for individual-level data).

Conclusions: Hispanic individuals are at higher risk of DM, while counties with larger Hispanic populations have lower DM prevalence. Analyses of population-level and individual-level data with multiple methods may afford more confidence in results and identify areas for further study.

Keywords: diabetes mellitus; ethnic groups; informatics; risk factors; type 2.

PubMed Disclaimer

Conflict of interest statement

Competing interests: DMM acts as an advisor for Sanofi-Aventis US LLC, Novo Nordisk Inc, WL Gore and Associates, Inc, and Medtronic Minimed, Inc. DS acts as an advisor for Carta Healthcare. No other potential conflicts of interest relevant to this article were reported.

Figures

Figure 1
Figure 1
Map of DM prevalence by county. Map of US counties according to county-level DM prevalence rates obtained from CHR data. CHR, County Health Rankings; DM, diabetes mellitus.

References

    1. American Diabetes Association Economic costs of diabetes in the U.S. in 2017. Diabetes Care 2018;41:917–28. 10.2337/dci18-0007 - DOI - PMC - PubMed
    1. Maffi P, Secchi A. The burden of diabetes: emerging data. Dev Ophthalmol 2017;60:1–5. 10.1159/000459641 - DOI - PubMed
    1. Dieleman JL, Cao J, Chapin A, et al. . Us health care spending by payer and health condition, 1996–2016. JAMA 2020;323:863–84. 10.1001/jama.2020.0734 - DOI - PMC - PubMed
    1. Zheng Y, Ley SH, Hu FB. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat Rev Endocrinol 2018;14:88–98. 10.1038/nrendo.2017.151 - DOI - PubMed
    1. Fletcher B, Gulanick M, Lamendola C. Risk factors for type 2 diabetes mellitus. J Cardiovasc Nurs 2002;16:17–23. 10.1097/00005082-200201000-00003 - DOI - PubMed

Publication types