Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 19;15(12):2907.
doi: 10.3390/ijerph15122907.

Correlation Analysis to Identify the Effective Data in Machine Learning: Prediction of Depressive Disorder and Emotion States

Affiliations

Correlation Analysis to Identify the Effective Data in Machine Learning: Prediction of Depressive Disorder and Emotion States

Sunil Kumar et al. Int J Environ Res Public Health. .

Abstract

Correlation analysis is an extensively used technique that identifies interesting relationships in data. These relationships help us realize the relevance of attributes with respect to the target class to be predicted. This study has exploited correlation analysis and machine learning-based approaches to identify relevant attributes in the dataset which have a significant impact on classifying a patient's mental health status. For mental health situations, correlation analysis has been performed in Weka, which involves a dataset of depressive disorder symptoms and situations based on weather conditions, as well as emotion classification based on physiological sensor readings. Pearson's product moment correlation and other different classification algorithms have been utilized for this analysis. The results show interesting correlations in weather attributes for bipolar patients, as well as in features extracted from physiological data for emotional states.

Keywords: correlation analysis; data analytics; health care; machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Correlation based on direction, form, and dispersion strength.
Figure 2
Figure 2
Methodology to identify strong predictor attributes.
Figure 3
Figure 3
Process flow of Data Analytics: (a) Emotion detection. (b) Identifying depressive disorder severity.
Figure 4
Figure 4
Scatter plot in Weka, of top-ranked weather parameters for Bipolar disorder.
Figure 5
Figure 5
Accuracies of prediction models with respect to stepwise feature selection.

Similar articles

Cited by

References

    1. Han J., Kamber M. Data Mining: Concepts and Techniques. 2nd ed. University of Illinois at Urbana-Champaign; Champaign, IL, USA: 2006.
    1. Bauman A.E., Sallis J.F., Dzewaltowski D.A., Owen N. Toward a better understanding of the influences on physical activity: The role of determinants, correlates, causal variables, mediators, moderators, and confounders. Am. J. Prev. Med. 2002;23:5–14. doi: 10.1016/S0749-3797(02)00469-5. - DOI - PubMed
    1. Park J.N., Han M.A., Park J., Ryu S.Y. Prevalence of Depressive Symptoms and Related Factors in Korean Employees: The Third Korean Working Conditions Survey (2011) Int. J. Environ. Res. Public Health. 2011;13:424. doi: 10.3390/ijerph13040424. - DOI - PMC - PubMed
    1. Choi K.-S., Kang S.-K. Occupational Psychiatric Disorders in Korea. J. Korean Med. Sci. 2010;25:87–93. doi: 10.3346/jkms.2010.25.S.S87. - DOI - PMC - PubMed
    1. Jeong B.G., Veenstra G. The intergenerational production of depression in South Korea: Results from a cross-sectional study. Jeong Veenstra Int. J. Equity Heal. 2017;16:13. doi: 10.1186/s12939-016-0513-7. - DOI - PMC - PubMed

Publication types

LinkOut - more resources