A comprehensive case study of deep learning on the detection of alpha thalassemia and beta thalassemia using public and private datasets
- PMID: 40246871
- PMCID: PMC12006322
- DOI: 10.1038/s41598-025-97353-0
A comprehensive case study of deep learning on the detection of alpha thalassemia and beta thalassemia using public and private datasets
Abstract
This study explores the performance of deep learning models, specifically Convolutional Neural Networks (CNN) and XGBoost, in predicting alpha and beta thalassemia using both public and private datasets. Thalassemia is a genetic disorder that impairs hemoglobin production, leading to anemia and other health complications. Early diagnosis is essential for effective management and prevention of severe health issues. The study applied CNN and XGBoost to two case studies: one for alpha-thalassemia and the other for beta-thalassemia. Public datasets were sourced from medical databases, while private datasets were collected from clinical records, offering a more comprehensive feature set and larger sample sizes. After data preprocessing and splitting, model performance was evaluated. XGBoost achieved 99.34% accuracy on the private dataset for alpha thalassemia, while CNN reached 98.10% accuracy on the private dataset for beta-thalassemia. The superior performance on private datasets was attributed to better data quality and volume. This study highlights the effectiveness of deep learning in medical diagnostics, demonstrating that high-quality data can significantly enhance the predictive capabilities of AI models. By integrating CNN and XGBoost, this approach offers a robust method for detecting thalassemia, potentially improving early diagnosis and reducing disease-related mortality.
Keywords: Alpha thalassemia; Beta thalassemia; CNN; Case study; Deep learning; Thalassemia; XGBoost.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests. Conflicts of Interest: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. Ethics approval and consent to participate: This study was approved by the Ethics committee of Riphah International University, Pakistan. Informed consent was obtained from all subjects. All the experimental procedures were performed by the standards of the Declaration of Helsinki 1964.
Figures





Similar articles
-
Exploring vision transformers and XGBoost as deep learning ensembles for transforming carcinoma recognition.Sci Rep. 2024 Dec 3;14(1):30052. doi: 10.1038/s41598-024-81456-1. Sci Rep. 2024. PMID: 39627293 Free PMC article.
-
Classification of α-thalassemia data using machine learning models.Comput Methods Programs Biomed. 2025 Mar;260:108581. doi: 10.1016/j.cmpb.2024.108581. Epub 2025 Jan 6. Comput Methods Programs Biomed. 2025. PMID: 39798280
-
Integrating deep learning and regression models for accurate prediction of groundwater fluoride contamination in old city in Bitlis province, Eastern Anatolia Region, Türkiye.Environ Sci Pollut Res Int. 2024 Jul;31(34):47201-47219. doi: 10.1007/s11356-024-34194-w. Epub 2024 Jul 11. Environ Sci Pollut Res Int. 2024. PMID: 38990257 Free PMC article.
-
Breast Cancer Detection Using Convolutional Neural Networks: A Deep Learning-Based Approach.Cureus. 2025 May 3;17(5):e83421. doi: 10.7759/cureus.83421. eCollection 2025 May. Cureus. 2025. PMID: 40322605 Free PMC article. Review.
-
Molecular study and prenatal diagnosis of alpha- and beta-thalassemias in Chinese.J Formos Med Assoc. 1998 Jan;97(1):5-15. J Formos Med Assoc. 1998. PMID: 9481058 Review.
Cited by
-
Multiclass classification of thalassemia types using complete blood count and HPLC data with machine learning.Sci Rep. 2025 Jul 21;15(1):26379. doi: 10.1038/s41598-025-06594-6. Sci Rep. 2025. PMID: 40691682 Free PMC article.
References
-
- Hirimutugoda, Y. M. & Wijayarathna, G. Image analysis system for detection of red cell disorders using artifcial neural networks. Sri Lanka J. Bio-Medical Inf.1 (1), 35 (2010).
-
- Zhuang, Q. et al. Te value of combined detection of HbA2 and HbF for the screening of thalassemia among individuals of childbearing ages. Zhonghua Yi Xue Yi Chuan Xue Za Zhi. 39 (1), 16–20 (2022). - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources