Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning-driven data analysis
- PMID: 38206587
- PMCID: PMC10783149
- DOI: 10.1093/gigascience/giad111
Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning-driven data analysis
Abstract
Background: Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable information from complex datasets. However, developing an effective and robust ML pipeline can present a real challenge, demanding considerable time and effort, thereby impeding research progress. Existing tools in this landscape require a profound understanding of ML principles and programming skills. Furthermore, users are required to engage in the comprehensive configuration of their ML pipeline to obtain optimal performance.
Results: To address these challenges, we have developed a novel tool called Machine Learning Made Easy (MLme) that streamlines the use of ML in research, specifically focusing on classification problems at present. By integrating 4 essential functionalities-namely, Data Exploration, AutoML, CustomML, and Visualization-MLme fulfills the diverse requirements of researchers while eliminating the need for extensive coding efforts. To demonstrate the applicability of MLme, we conducted rigorous testing on 6 distinct datasets, each presenting unique characteristics and challenges. Our results consistently showed promising performance across different datasets, reaffirming the versatility and effectiveness of the tool. Additionally, by utilizing MLme's feature selection functionality, we successfully identified significant markers for CD8+ naive (BACH2), CD16+ (CD16), and CD14+ (VCAN) cell populations.
Conclusion: MLme serves as a valuable resource for leveraging ML to facilitate insightful data analysis and enhance research outcomes, while alleviating concerns related to complex coding scripts. The source code and a detailed tutorial for MLme are available at https://github.com/FunctionalUrology/MLme.
Keywords: AutoML; classification problems; data analysis; machine learning; visualization.
© The Author(s) 2024. Published by Oxford University Press GigaScience.
Conflict of interest statement
The authors declare they have no competing interests.
Figures



Update of
-
Machine Learning Made Easy (MLme): A Comprehensive Toolkit for Machine Learning-Driven Data Analysis.bioRxiv [Preprint]. 2023 Jul 4:2023.07.04.546825. doi: 10.1101/2023.07.04.546825. bioRxiv. 2023. Update in: Gigascience. 2024 Jan 2;13:giad111. doi: 10.1093/gigascience/giad111. PMID: 37461685 Free PMC article. Updated. Preprint.
Similar articles
-
Machine Learning Made Easy (MLme): A Comprehensive Toolkit for Machine Learning-Driven Data Analysis.bioRxiv [Preprint]. 2023 Jul 4:2023.07.04.546825. doi: 10.1101/2023.07.04.546825. bioRxiv. 2023. Update in: Gigascience. 2024 Jan 2;13:giad111. doi: 10.1093/gigascience/giad111. PMID: 37461685 Free PMC article. Updated. Preprint.
-
ShinyLearner: A containerized benchmarking tool for machine-learning classification of tabular data.Gigascience. 2020 Apr 1;9(4):giaa026. doi: 10.1093/gigascience/giaa026. Gigascience. 2020. PMID: 32249316 Free PMC article.
-
iProps: A Comprehensive Software Tool for Protein Classification and Analysis With Automatic Machine Learning Capabilities and Model Interpretation Capabilities.IEEE J Biomed Health Inform. 2024 Oct;28(10):6237-6247. doi: 10.1109/JBHI.2024.3425716. Epub 2024 Oct 3. IEEE J Biomed Health Inform. 2024. PMID: 39008396
-
Automated machine learning: Review of the state-of-the-art and opportunities for healthcare.Artif Intell Med. 2020 Apr;104:101822. doi: 10.1016/j.artmed.2020.101822. Epub 2020 Feb 21. Artif Intell Med. 2020. PMID: 32499001 Review.
-
Automated machine learning with R: AutoML tools for beginners in clinical research.J Minim Invasive Surg. 2024 Sep 15;27(3):129-137. doi: 10.7602/jmis.2024.27.3.129. J Minim Invasive Surg. 2024. PMID: 39300720 Free PMC article. Review.
Cited by
-
Machine Learning Made Easy (MLme): A Comprehensive Toolkit for Machine Learning-Driven Data Analysis.bioRxiv [Preprint]. 2023 Jul 4:2023.07.04.546825. doi: 10.1101/2023.07.04.546825. bioRxiv. 2023. Update in: Gigascience. 2024 Jan 2;13:giad111. doi: 10.1093/gigascience/giad111. PMID: 37461685 Free PMC article. Updated. Preprint.
-
MLcps: machine learning cumulative performance score for classification problems.Gigascience. 2022 Dec 28;12:giad108. doi: 10.1093/gigascience/giad108. Epub 2023 Dec 13. Gigascience. 2022. PMID: 38091508 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials