Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep;37(5):2569-2584.
doi: 10.1002/hpm.3483. Epub 2022 Apr 20.

Data-driven treatment pathways mining for early breast cancer using cSPADE algorithm and system clustering

Affiliations

Data-driven treatment pathways mining for early breast cancer using cSPADE algorithm and system clustering

Qing Yang et al. Int J Health Plann Manage. 2022 Sep.

Abstract

Objectives: Due to the multidimensional, multilayered, and chronological order of the cancer data, it was challenging for us to extract treatment paths. To determine whether the cSPADE algorithm and system clustering proposed in this study can effectively identify the treatment pathways for early breast cancer.

Methods: We applied data mining technology to the electronic medical records of 6891 early breast cancer patients to mine treatment pathways. We provided a method of extracting data from EMR and performed three-stage mining: determining the treatment stage through the cSPADE algorithm → system clustering for treatment plan extraction → cSPADE mining sequence pattern for treatment. The Kolmogorov-Smirnov test and correlation analysis were used to cross-validate the sequence rules of early breast cancer treatment pathways.

Results: We unearthed 55 sequence rules for early breast cancer treatment, 3 preoperative neoadjuvant chemotherapy regimens, three postoperative chemotherapy regimens, and 2 chemotherapy regimens for patients without surgery. Through 5-fold cross-validation, Pearson and Spearman correlation tests were performed. At the significance level of p < 0.05, all correlation coefficients of support, confidence and lift were greater than 0.89. Using the Kolmogorov-Smirnov test, we found no significant differences between the sequence distributions.

Conclusions: We have proved that cSPADE algorithm combined system clustering is an effective technique for identifying temporal relationships between treatment modalities, enabling hierarchical and vertical mining of breast cancer treatment models. In addition, we confirmed the robustness of the results by cross-validation of these treatment pathway ordering rules. Through this method, the treatment path of early breast cancer patients can be revealed, and the real-world breast cancer treatment behaviour model can be evaluated, which can provide reference for the redesign and optimization of treatment path.

Keywords: breast cancer; clinical pathway; cluster analysis; data mining; sequential pattern mining.

PubMed Disclaimer

References

REFERENCES

    1. Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209-249.
    1. Yamauchi C, Sekiguchi K, Nishioka A, et al. The Japanese breast cancer Society clinical practice guideline for radiation treatment of breast cancer, 2015 edition. Breast Cancer. 2016;23(3):378-390.
    1. Lefeuvre D, Le Bihan-Benjamin C, Pauporté Iris, Medioni J, Bousquet P-J. French Medico-Administrative data to identify the care pathways of women with breast cancer. Clini Breast Cancer. 2017;17(4):e191-e197.
    1. Miguel RTD, Silvestre MAA, Imperial MLS, et al. Appraisal of the methodological quality of clinical practice guidelines in the Philippines. Int J Health Plann Manage. 2019;34(4):e1723-e1735.
    1. Dy SM, Garg P, Nyberg D, et al. Critical pathway effectiveness: assessing the Impact of patient, hospital care, and pathway characteristics using Qualitative Comparative analysis. Health Serv Res. 2005;40(2):499-516.

LinkOut - more resources