Data-driven treatment pathways mining for early breast cancer using cSPADE algorithm and system clustering
- PMID: 35445441
- DOI: 10.1002/hpm.3483
Data-driven treatment pathways mining for early breast cancer using cSPADE algorithm and system clustering
Abstract
Objectives: Due to the multidimensional, multilayered, and chronological order of the cancer data, it was challenging for us to extract treatment paths. To determine whether the cSPADE algorithm and system clustering proposed in this study can effectively identify the treatment pathways for early breast cancer.
Methods: We applied data mining technology to the electronic medical records of 6891 early breast cancer patients to mine treatment pathways. We provided a method of extracting data from EMR and performed three-stage mining: determining the treatment stage through the cSPADE algorithm → system clustering for treatment plan extraction → cSPADE mining sequence pattern for treatment. The Kolmogorov-Smirnov test and correlation analysis were used to cross-validate the sequence rules of early breast cancer treatment pathways.
Results: We unearthed 55 sequence rules for early breast cancer treatment, 3 preoperative neoadjuvant chemotherapy regimens, three postoperative chemotherapy regimens, and 2 chemotherapy regimens for patients without surgery. Through 5-fold cross-validation, Pearson and Spearman correlation tests were performed. At the significance level of p < 0.05, all correlation coefficients of support, confidence and lift were greater than 0.89. Using the Kolmogorov-Smirnov test, we found no significant differences between the sequence distributions.
Conclusions: We have proved that cSPADE algorithm combined system clustering is an effective technique for identifying temporal relationships between treatment modalities, enabling hierarchical and vertical mining of breast cancer treatment models. In addition, we confirmed the robustness of the results by cross-validation of these treatment pathway ordering rules. Through this method, the treatment path of early breast cancer patients can be revealed, and the real-world breast cancer treatment behaviour model can be evaluated, which can provide reference for the redesign and optimization of treatment path.
Keywords: breast cancer; clinical pathway; cluster analysis; data mining; sequential pattern mining.
© 2022 John Wiley & Sons Ltd.
References
REFERENCES
-
- Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209-249.
-
- Yamauchi C, Sekiguchi K, Nishioka A, et al. The Japanese breast cancer Society clinical practice guideline for radiation treatment of breast cancer, 2015 edition. Breast Cancer. 2016;23(3):378-390.
-
- Lefeuvre D, Le Bihan-Benjamin C, Pauporté Iris, Medioni J, Bousquet P-J. French Medico-Administrative data to identify the care pathways of women with breast cancer. Clini Breast Cancer. 2017;17(4):e191-e197.
-
- Miguel RTD, Silvestre MAA, Imperial MLS, et al. Appraisal of the methodological quality of clinical practice guidelines in the Philippines. Int J Health Plann Manage. 2019;34(4):e1723-e1735.
-
- Dy SM, Garg P, Nyberg D, et al. Critical pathway effectiveness: assessing the Impact of patient, hospital care, and pathway characteristics using Qualitative Comparative analysis. Health Serv Res. 2005;40(2):499-516.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical