Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 9;11(12):e0167055.
doi: 10.1371/journal.pone.0167055. eCollection 2016.

Into the Bowels of Depression: Unravelling Medical Symptoms Associated with Depression by Applying Machine-Learning Techniques to a Community Based Population Sample

Affiliations

Into the Bowels of Depression: Unravelling Medical Symptoms Associated with Depression by Applying Machine-Learning Techniques to a Community Based Population Sample

Joanna F Dipnall et al. PLoS One. .

Abstract

Background: Depression is commonly comorbid with many other somatic diseases and symptoms. Identification of individuals in clusters with comorbid symptoms may reveal new pathophysiological mechanisms and treatment targets. The aim of this research was to combine machine-learning (ML) algorithms with traditional regression techniques by utilising self-reported medical symptoms to identify and describe clusters of individuals with increased rates of depression from a large cross-sectional community based population epidemiological study.

Methods: A multi-staged methodology utilising ML and traditional statistical techniques was performed using the community based population National Health and Nutrition Examination Study (2009-2010) (N = 3,922). A Self-organised Mapping (SOM) ML algorithm, combined with hierarchical clustering, was performed to create participant clusters based on 68 medical symptoms. Binary logistic regression, controlling for sociodemographic confounders, was used to then identify the key clusters of participants with higher levels of depression (PHQ-9≥10, n = 377). Finally, a Multiple Additive Regression Tree boosted ML algorithm was run to identify the important medical symptoms for each key cluster within 17 broad categories: heart, liver, thyroid, respiratory, diabetes, arthritis, fractures and osteoporosis, skeletal pain, blood pressure, blood transfusion, cholesterol, vision, hearing, psoriasis, weight, bowels and urinary.

Results: Five clusters of participants, based on medical symptoms, were identified to have significantly increased rates of depression compared to the cluster with the lowest rate: odds ratios ranged from 2.24 (95% CI 1.56, 3.24) to 6.33 (95% CI 1.67, 24.02). The ML boosted regression algorithm identified three key medical condition categories as being significantly more common in these clusters: bowel, pain and urinary symptoms. Bowel-related symptoms was found to dominate the relative importance of symptoms within the five key clusters.

Conclusion: This methodology shows promise for the identification of conditions in general populations and supports the current focus on the potential importance of bowel symptoms and the gut in mental health research.

PubMed Disclaimer

Conflict of interest statement

JFD has no conflicts of interest in relation to this manuscript. JAP has recently received grant/research support from the National Health and Medical Research Council (NHMRC), BUPA Foundation, Amgen,/GlaxoSmithKline/Osteoporosis Australia/Australian and New Zealand Bone and Mineral Society, Western Alliance, Barwon Health, Deakin University and the Geelong Community Foundation. MB has received Grant/Research Support from the NIH, Cooperative Research Centre, Simons Autism Foundation, Cancer Council of Victoria, Stanley Medical Research Foundation, MBF, NHMRC, Beyond Blue, Rotary Health, Geelong Medical Research Foundation, Bristol Myers Squibb, Eli Lilly, Glaxo SmithKline, Meat and Livestock Board, Organon, Novartis, Mayne Pharma, Servier and Woolworths, has been a speaker for Astra Zeneca, Bristol Myers Squibb, Eli Lilly, Glaxo SmithKline, Janssen Cilag, Lundbeck, Merck, Pfizer, Sanofi Synthelabo, Servier, Solvay and Wyeth, and served as a consultant to Astra Zeneca, Bioadvantex, Bristol Myers Squibb, Eli Lilly, Glaxo SmithKline, Janssen Cilag, Lundbeck Merck and Servier. Drs Copolov, MB and Bush are co-inventors of provisional patent 02799377.3-2107-AU02 “Modulation of physiological process and agents useful for same”. MB and Laupu are co-authors of provisional patent 2014900627 “Modulation of diseases of the central nervous system and related disorders”. MB is supported by a NHMRC Senior Principal Research Fellowship 1059660. LJW is supported by a NHMRC Career Development Fellowship 1064272. SD has received grants/research support from the Stanley Medical Research Institute, NHMRC, Beyond Blue, ARHRF, Simons Foundation, Geelong Medical Research Foundation, Fondation FondaMental, Eli Lilly, Glaxo SmithKline, Organon, Mayne Pharma and Servier, speaker’s fees from Eli Lilly, advisory board fees from Eli Lilly and Novartis, and conference travel support from Servier. FNJ has received Grant/Research support from the Brain and Behaviour Research Institute, the National Health and Medical Research Council (NHMRC), Australian Rotary Health, the Geelong Medical Research Foundation, the Ian Potter Foundation, Eli Lilly, the Meat and Livestock Board and The University of Melbourne and has received speakers honoraria from Sanofi-Synthelabo, Janssen Cilag, Servier, Pfizer, Health Ed, Network Nutrition, Angelini Farmaceutica, and Eli Lilly. She is supported by an NHMRC Career Development Fellowship (#1108125). DM has received grant/research support from the Australian research Council (ARC), Mental Illness Research Fund (MIRF), Victorian Department of Justice, Beyond Blue, Swinburne University of Technology, Federal University. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Figures

Fig 1
Fig 1. Flowchart of Methods, Testing and Results.
Fig 2
Fig 2. Training progress and SOM plots.
Note: The “Training progress” graph indicates as the SOM training iterations distance from each node's weights to the samples represented by that node reduces and plateaus to indicate no more iterations were required. The “Counts plots” indicates reasonable samples were mapped to each node on the map. The “Neighbour distance plot” or U-Matrix indicates the distance between each node and its neighbours.
Fig 3
Fig 3. Hierarchical Cluster Options for SOM.
Note: Clusters 3 to 12 solutions mapped onto the SOM grid. Colours indicate different clusters. The final 10 cluster solution selected for further analysis has been highlighted with a red border.
Fig 4
Fig 4. Mean depression scores and percent depression across final depression clusters.
Note: “Mean Depression Score” is the average total PHQ-9 score which ranged from 0 to 27. “Percent Depressed” based on a total PHQ-9 ≥ 10.
Fig 5
Fig 5. Predicted probability of depression across age and family income poverty ratio for each cluster.
Fig 6
Fig 6. Importance of medical categories that make up the key significant clusters.
Note: Based on total boosted relative importance percentage across all clusters. Summed percentage from boosted regression across all five key significant clusters, thus total >100%.
Fig 7
Fig 7. Total percentage importance of medical conditions for each key significant cluster.
Note: Clusters presented in order of percent depressed. Note: Percentage sum does not take account of direction of relationship.

References

    1. Sanna L, Stuart AL, Pasco JA, Kotowicz MA, Berk M, et al. (2013) Physical comorbidities in men with mood and anxiety disorders: a population-based study. BMC Med 11: 1. - PMC - PubMed
    1. Sanna L, Stuart AL, Pasco JA, Jacka FN, Berk M, et al. (2014) Atopic disorders and depression: findings from a large, population-based study. J Affect Disord 155: 261–265. 10.1016/j.jad.2013.11.009 - DOI - PubMed
    1. Fond G, Loundou A, Hamdani N, Boukouaci W, Dargel A, et al. (2014) Anxiety and depression comorbidities in irritable bowel syndrome (IBS): a systematic review and meta-analysis. Eur Arch Psychiatry Clin Neurosci 264: 651–660. 10.1007/s00406-014-0502-z - DOI - PubMed
    1. Kronish IM, Carson AP, Davidson KW, Muntner P, Safford MM (2012) Depressive symptoms and cardiovascular health by the american heart association’s definition in the reasons for geographic and racial differences in stroke (REGARDS) study. PLoS One 7: e52771 10.1371/journal.pone.0052771 - DOI - PMC - PubMed
    1. Massie MJ (2004) Prevalence of depression in patients with cancer. Monographs-National Cancer Institute 32: 57–71. - PubMed

MeSH terms