Breast cancer recurrence prediction with ensemble methods and cost-sensitive learning
- PMID: 34027105
- PMCID: PMC8122465
- DOI: 10.1515/med-2021-0282
Breast cancer recurrence prediction with ensemble methods and cost-sensitive learning
Abstract
Breast cancer is one of the most common cancers in women all over the world. Due to the improvement of medical treatments, most of the breast cancer patients would be in remission. However, the patients have to face the next challenge, the recurrence of breast cancer which may cause more severe effects, and even death. The prediction of breast cancer recurrence is crucial for reducing mortality. This paper proposes a prediction model for the recurrence of breast cancer based on clinical nominal and numeric features. In this study, our data consist of 1,061 patients from Breast Cancer Registry from Shin Kong Wu Ho-Su Memorial Hospital between 2011 and 2016, in which 37 records are denoted as breast cancer recurrence. Each record has 85 features. Our approach consists of three stages. First, we perform data preprocessing and feature selection techniques to consolidate the dataset. Among all features, six features are identified for further processing in the following stages. Next, we apply resampling techniques to resolve the issue of class imbalance. Finally, we construct two classifiers, AdaBoost and cost-sensitive learning, to predict the risk of recurrence and carry out the performance evaluation in three-fold cross-validation. By applying the AdaBoost method, we achieve accuracy of 0.973 and sensitivity of 0.675. By combining the AdaBoost and cost-sensitive method of our model, we achieve a reasonable accuracy of 0.468 and substantially high sensitivity of 0.947 which guarantee almost no false dismissal. Our model can be used as a supporting tool in the setting and evaluation of the follow-up visit for early intervention and more advanced treatments to lower cancer mortality.
Keywords: AdaBoost; classification; cost-sensitive method; machine learning; recurrent breast cancer.
© 2021 Pei-Tse Yang et al., published by De Gruyter.
Conflict of interest statement
Conflict of interest: The authors have no conflicts of interest to declare.
Figures
Similar articles
-
Reviewing ensemble classification methods in breast cancer.Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20. Comput Methods Programs Biomed. 2019. PMID: 31319964 Review.
-
Machine learning models in breast cancer survival prediction.Technol Health Care. 2016;24(1):31-42. doi: 10.3233/THC-151071. Technol Health Care. 2016. PMID: 26409558
-
Homogeneous Adaboost Ensemble Machine Learning Algorithms with Reduced Entropy on Balanced Data.Entropy (Basel). 2023 Jan 29;25(2):245. doi: 10.3390/e25020245. Entropy (Basel). 2023. PMID: 36832611 Free PMC article.
-
Full Intelligent Cancer Classification of Thermal Breast Images to Assist Physician in Clinical Diagnostic Applications.J Med Signals Sens. 2016 Jan-Mar;6(1):12-24. J Med Signals Sens. 2016. PMID: 27014608 Free PMC article.
-
Overview of resistance to systemic therapy in patients with breast cancer.Adv Exp Med Biol. 2007;608:1-22. doi: 10.1007/978-0-387-74039-3_1. Adv Exp Med Biol. 2007. PMID: 17993229 Review.
Cited by
-
Cancer Stem Cells (CSCs), Circulating Tumor Cells (CTCs) and Their Interplay with Cancer Associated Fibroblasts (CAFs): A New World of Targets and Treatments.Cancers (Basel). 2022 May 13;14(10):2408. doi: 10.3390/cancers14102408. Cancers (Basel). 2022. PMID: 35626011 Free PMC article. Review.
-
Machine learning-based models for the prediction of breast cancer recurrence risk.BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z. BMC Med Inform Decis Mak. 2023. PMID: 38031071 Free PMC article.
-
Application of machine learning algorithms in predicting HIV infection among men who have sex with men: Model development and validation.Front Public Health. 2022 Aug 25;10:967681. doi: 10.3389/fpubh.2022.967681. eCollection 2022. Front Public Health. 2022. PMID: 36091522 Free PMC article.
-
Application of machine learning in breast cancer survival prediction using a multimethod approach.Sci Rep. 2024 Dec 3;14(1):30147. doi: 10.1038/s41598-024-81734-y. Sci Rep. 2024. PMID: 39627494 Free PMC article.
-
Predictive model and risk analysis for coronary heart disease in people living with HIV using machine learning.BMC Med Inform Decis Mak. 2024 Apr 25;24(1):110. doi: 10.1186/s12911-024-02511-5. BMC Med Inform Decis Mak. 2024. PMID: 38664736 Free PMC article.
References
-
- World Health Organization. WHO position paper on mammography screening [Internet]. Switzerland: World Health Organization; 2014. Available From: https://apps.who.int/iris/handle/10665/137339
- World Health Organization. WHO position paper on mammography screening [Internet] Switzerland: World Health Organization; 2014. Available From: https://apps.who.int/iris/handle/10665/137339.
-
- American Cancer Society. Cancer facts & figures 2020 [Internet]. Atlanta: American Cancer Society; 2020. Available From: https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-...
- American Cancer Society. Cancer facts & figures 2020 [Internet] Atlanta: American Cancer Society; 2020. Available From: https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-....
-
- Kim J, Shin H. Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data. J Am Med Inf Assoc. 2013;20(4):613–8. 10.1136/amiajnl-2012-001570. PubMed PMID: 23467471; PubMed Central PMCID: PMC3721173. - DOI - PMC - PubMed
- Kim J, Shin H. Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data. J Am Med Inf Assoc. 2013;20(4):613–8. doi: 10.1136/amiajnl-2012-001570. PubMed PMID: 23467471; PubMed Central PMCID: PMC3721173. - DOI - PMC - PubMed
-
- Hsu JL, Hung PC, Lin HY, Hsieh CH. Applying under-sampling techniques and cost-sensitive learning methods on risk assessment of breast cancer. J Med Syst. 2015 Apr;39(4):1–3. 10.1007/s10916-015-0210-x. PubMed PMID: 25712814. - DOI - PubMed
- Hsu JL, Hung PC, Lin HY, Hsieh CH. Applying under-sampling techniques and cost-sensitive learning methods on risk assessment of breast cancer. J Med Syst. 2015 Apr;39(4):1–3. doi: 10.1007/s10916-015-0210-x. PubMed PMID: 25712814. - DOI - PubMed
-
- Seely JM, Alhassan T. Screening for breast cancer in 2018-what should we be doing today? Curr Oncol. 2018 Jun;25(Suppl 1):S115–24. 10.374/co.25.3770. PubMed PMID:29910654; PubMed Central PMCID: PMC6001765. - DOI - PMC - PubMed
- Seely JM, Alhassan T.. Screening for breast cancer in 2018-what should we be doing today? Curr Oncol. 2018 Jun;25(Suppl 1):S115–24. doi: 10.374/co.25.3770. PubMed PMID:29910654; PubMed Central PMCID: PMC6001765. - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Other Literature Sources