Understanding Cancer Risk Among Bangladeshi Women: An Explainable Machine Learning Approach to Socio-Reproductive Factors Using Tertiary Hospital Data
- PMID: 40565458
- PMCID: PMC12192815
- DOI: 10.3390/healthcare13121432
Understanding Cancer Risk Among Bangladeshi Women: An Explainable Machine Learning Approach to Socio-Reproductive Factors Using Tertiary Hospital Data
Abstract
Background: Breast cancer poses a significant health challenge in Bangladesh, where limited screening and unique reproductive patterns contribute to delayed diagnoses and subtype-specific disparities. While reproductive risk factors such as age at menarche, parity, and contraceptive use are well studied in high-income countries, their associations with hormone-receptor-positive (HR+) and triple-negative breast cancer (TNBC) remain underexplored in low-resource settings.
Methods: A case-control study was conducted at the National Institute of Cancer Research and Hospital (NICRH) including 486 histopathologically confirmed breast cancer cases (246 HR+, 240 TNBC) and 443 cancer-free controls. Socio-demographic and reproductive data were collected through structured interviews. Machine learning models-including Logistic Regression, Lasso, Support Vector Machines, Random Forest, and XGBoost-were trained using stratified five-fold cross-validation. Model performance was evaluated using sensitivity, F1-score, and Area Under Receiver Operating Curve (AUROC). To interpret model predictions and quantify the contribution of individual features, we employed Shapley Additive exPlanation (SHAP) values.
Results: XGBoost achieved the highest overall performance (F1-score = 0.750), and SHAP-based interpretability revealed key predictors for each subtype. Rural residence, low education (≤5 years), and undernutrition were significant predictors across subtypes. Cesarean delivery and multiple abortions were more predictive of TNBC, while urban residence, employment, and higher education were more predictive of HR+. Age at menarche and age at first childbirth showed decreasing predictive importance with increasing age for HR+, while larger gaps between marriage and childbirth were more predictive of TNBC.
Conclusions: Our findings underscore the value of machine learning coupled with SHAP-based explainability in identifying context-specific risk factors for breast cancer subtypes in resource-limited settings. This approach enhances transparency and supports the development of targeted public health interventions to reduce breast cancer disparities in Bangladesh.
Keywords: breast cancer risk; explainable machine learning; women reproductive risk factors.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures




Similar articles
-
Application of machine learning algorithms and SHAP explanations to predict fertility preference among reproductive women in Somalia.Sci Rep. 2025 Jul 20;15(1):26301. doi: 10.1038/s41598-025-04704-y. Sci Rep. 2025. PMID: 40685342 Free PMC article.
-
Exploring explainable machine learning algorithms to model predictors of tobacco use among men in Sub Sahara Africa between 2018 and 2023.Sci Rep. 2025 Jul 9;15(1):24646. doi: 10.1038/s41598-025-09380-6. Sci Rep. 2025. PMID: 40634414 Free PMC article.
-
Predicting Early-Onset Colorectal Cancer in Individuals Below Screening Age Using Machine Learning and Real-World Data: Case Control Study.JMIR Cancer. 2025 Jun 19;11:e64506. doi: 10.2196/64506. JMIR Cancer. 2025. PMID: 40537065 Free PMC article.
-
Maternal and neonatal outcomes of elective induction of labor.Evid Rep Technol Assess (Full Rep). 2009 Mar;(176):1-257. Evid Rep Technol Assess (Full Rep). 2009. PMID: 19408970 Free PMC article.
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
References
-
- Urbanization in Bangladesh the Prevalence of Breast Cancer Brings Unique Challenges—The ASCO Post [Internet] [(accessed on 11 March 2024)]. Available online: https://ascopost.com/issues/october-25-2021/urbanization-in-bangladesh-t...
LinkOut - more resources
Full Text Sources