Prediction of [Formula: see text]-Thalassemia carriers using complete blood count features
- PMID: 36411295
- PMCID: PMC9678892
- DOI: 10.1038/s41598-022-22011-8
Prediction of [Formula: see text]-Thalassemia carriers using complete blood count features
Abstract
[Formula: see text]-Thalassemia is one of the dangerous causes of the high mortality rate in the Mediterranean countries. Substantial resources are required to save a [Formula: see text]-Thalassemia carriers' life and early detection of thalassemia patients can help appropriate treatment to increase the carrier's life expectancy. Being a genetic disease, it can not be prevented however the analysis of several indicators in parents' blood can be used to detect disorders causing Thalassemia. Laboratory tests for Thalassemia are time-consuming and expensive like high-performance liquid chromatography, Complete Blood Count (CBC) with peripheral smear, genetic test, etc. Red blood indices from CBC can be used with machine learning models for the same task. Despite the available approaches for Thalassemia carriers from CBC data, gaps exist between the desired and achieved accuracy. Moreover, the data imbalance problem is studied well which makes the models less generalizable. This study proposes a highly accurate approach for [Formula: see text]-Thalassemia detection using red blood indices from CBC augmented by supervised machine learning. In view of the fact that all the features do not carry predictive information regarding the target variable, this study employs a unified framework of two features selection techniques including Principal Component Analysis (PCA) and Singular Vector Decomposition (SVD). The data imbalance between [Formula: see text]-Thalassemia carrier and non-carriers is handled by Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic (ADASYN). Extensive experiments are performed using many state-of-the-art machine learning models and deep learning models. Experimental results indicate the superiority of the proposed approach over existing approaches with an accuracy score of 0.96.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures












Similar articles
-
Classification of α-thalassemia data using machine learning models.Comput Methods Programs Biomed. 2025 Mar;260:108581. doi: 10.1016/j.cmpb.2024.108581. Epub 2025 Jan 6. Comput Methods Programs Biomed. 2025. PMID: 39798280
-
Identifying β-thalassemia carriers using a data mining approach: The case of the Gaza Strip, Palestine.Artif Intell Med. 2018 Jun;88:70-83. doi: 10.1016/j.artmed.2018.04.009. Epub 2018 May 3. Artif Intell Med. 2018. PMID: 29730048
-
Assessing predictive performance of supervised machine learning algorithms for a diamond pricing model.Sci Rep. 2023 Oct 12;13(1):17315. doi: 10.1038/s41598-023-44326-w. Sci Rep. 2023. PMID: 37828360 Free PMC article.
-
Carrier screening for thalassemia and hemoglobinopathies in Canada.J Obstet Gynaecol Can. 2008 Oct;30(10):950-959. doi: 10.1016/S1701-2163(16)32975-9. J Obstet Gynaecol Can. 2008. PMID: 19038079 Review. English, French.
-
A review on design of scaffold for osteoinduction: Toward the unification of independent design variables.Biomech Model Mechanobiol. 2023 Feb;22(1):1-21. doi: 10.1007/s10237-022-01635-9. Epub 2022 Sep 19. Biomech Model Mechanobiol. 2023. PMID: 36121530 Review.
Cited by
-
Multidisciplinary approaches to study anaemia with special mention on aplastic anaemia (Review).Int J Mol Med. 2024 Nov;54(5):95. doi: 10.3892/ijmm.2024.5419. Epub 2024 Sep 2. Int J Mol Med. 2024. PMID: 39219286 Free PMC article. Review.
-
Predicting Thalassemia Using Feature Selection Techniques: A Comparative Analysis.Diagnostics (Basel). 2023 Nov 14;13(22):3441. doi: 10.3390/diagnostics13223441. Diagnostics (Basel). 2023. PMID: 37998577 Free PMC article.
-
Multiclass classification of thalassemia types using complete blood count and HPLC data with machine learning.Sci Rep. 2025 Jul 21;15(1):26379. doi: 10.1038/s41598-025-06594-6. Sci Rep. 2025. PMID: 40691682 Free PMC article.
References
-
- Arif F, Fayyaz J, Hamid A. Awareness among parents of children with thalassemia major. J. Pak. Med. Assoc. 2008;58:621–624. - PubMed
-
- Asif N, Hassan K. Management of thalassemia in Pakistan. J. Islamabad Med. Dent. Coll. 2016;5:152–153.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical