Enhancing the efficacy of depression detection system using optimal feature selection from EHR
- PMID: 36820618
- DOI: 10.1080/10255842.2023.2181660
Enhancing the efficacy of depression detection system using optimal feature selection from EHR
Abstract
Diagnosing depression at an early stage is crucial and majorly depends on the clinician's skill. The present work aims to develop an automated tool for assisting the diagnostic procedure of depression using multiple machine-learning techniques. The dataset of sample size 4184 used in this study contains biometric and demographic information of individuals with or without depression, accessed from the University of Nice Sophia-Antipolis. The Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) are used for classifying the depressed from the control group. To enhance the computational efficiency, various feature selection algorithms like Recursive Feature Elimination (RFE), Mutual Information (MI) and three bio-inspired techniques, viz. Particle Swarm Optimization (PSO), Genetic Algorithm (GA) and Firefly Algorithms (FA) have been incorporated. To enhance the feature selection process further, majority voting is carried out in all possible combinations of three, four and five feature selection techniques. These feature selection techniques bring down the feature set size significantly to a mean of 33 from the actual size of 61 which is a reduction of 45.90%. The classification accuracy of the enhanced model varies between 84.18% and 88.46%, which is a significant improvement in performance as compared to the pre-existing models (83.76-85.89%). The proposed predictive models outperform the pre-existing classification models without feature selection and thereby enhancing both the performance and efficiency of the diagnostic process.
Keywords: Depression; firefly algorithm; genetic algorithm; machine learning; particle swarm optimization.
Similar articles
-
An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018. Biomed Res Int. 2018. PMID: 30228989 Free PMC article.
-
An integrated approach of feature selection and machine learning for early detection of breast cancer.Sci Rep. 2025 Apr 15;15(1):13015. doi: 10.1038/s41598-025-97685-x. Sci Rep. 2025. PMID: 40234520 Free PMC article.
-
Union With Recursive Feature Elimination: A Feature Selection Framework to Improve the Classification Performance of Multicategory Causes of Death in Colorectal Cancer.Lab Invest. 2024 Mar;104(3):100320. doi: 10.1016/j.labinv.2023.100320. Epub 2023 Dec 28. Lab Invest. 2024. PMID: 38158124
-
A hybrid machine learning feature selection model-HMLFSM to enhance gene classification applied to multiple colon cancers dataset.PLoS One. 2023 Nov 2;18(11):e0286791. doi: 10.1371/journal.pone.0286791. eCollection 2023. PLoS One. 2023. PMID: 37917732 Free PMC article. Review.
-
Artificial intelligence and machine learning approaches in composting process: A review.Bioresour Technol. 2023 Feb;370:128539. doi: 10.1016/j.biortech.2022.128539. Epub 2023 Jan 3. Bioresour Technol. 2023. PMID: 36608858 Review.
Cited by
-
A multimodal prediction model for suicidal attempter in major depressive disorder.PeerJ. 2023 Nov 8;11:e16362. doi: 10.7717/peerj.16362. eCollection 2023. PeerJ. 2023. PMID: 37953785 Free PMC article.
-
Explainable AI for enhanced accuracy in malaria diagnosis using ensemble machine learning models.BMC Med Inform Decis Mak. 2025 Apr 11;25(1):162. doi: 10.1186/s12911-025-02874-3. BMC Med Inform Decis Mak. 2025. PMID: 40217281 Free PMC article.
-
Enhancing anomaly detection in IoT-driven factories using Logistic Boosting, Random Forest, and SVM: A comparative machine learning approach.Sci Rep. 2025 Jul 3;15(1):23694. doi: 10.1038/s41598-025-08436-x. Sci Rep. 2025. PMID: 40610511 Free PMC article.
-
How do machine learning models perform in the detection of depression, anxiety, and stress among undergraduate students? A systematic review.Cad Saude Publica. 2024 Dec 20;40(11):e00029323. doi: 10.1590/0102-311XEN029323. eCollection 2024. Cad Saude Publica. 2024. PMID: 39775769 Free PMC article.
-
Utilizing machine learning to facilitate the early diagnosis of posterior circulation stroke.BMC Neurol. 2024 May 7;24(1):156. doi: 10.1186/s12883-024-03638-8. BMC Neurol. 2024. PMID: 38714968 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical