Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 5;14(1):253.
doi: 10.3390/cancers14010253.

Machine Learning to Discern Interactive Clusters of Risk Factors for Late Recurrence of Metastatic Breast Cancer

Affiliations

Machine Learning to Discern Interactive Clusters of Risk Factors for Late Recurrence of Metastatic Breast Cancer

Juan Luis Gomez Marti et al. Cancers (Basel). .

Abstract

Background: Risk of metastatic recurrence of breast cancer after initial diagnosis and treatment depends on the presence of a number of risk factors. Although most univariate risk factors have been identified using classical methods, machine-learning methods are also being used to tease out non-obvious contributors to a patient's individual risk of developing late distant metastasis. Bayesian-network algorithms can identify not only risk factors but also interactions among these risks, which consequently may increase the risk of developing metastatic breast cancer. We proposed to apply a previously developed machine-learning method to discern risk factors of 5-, 10- and 15-year metastases.

Methods: We applied a previously validated algorithm named the Markov Blanket and Interactive Risk Factor Learner (MBIL) to the electronic health record (EHR)-based Lynn Sage Database (LSDB) from the Lynn Sage Comprehensive Breast Center at Northwestern Memorial Hospital. This algorithm provided an output of both single and interactive risk factors of 5-, 10-, and 15-year metastases from the LSDB. We individually examined and interpreted the clinical relevance of these interactions based on years to metastasis and reliance on interactivity between risk factors.

Results: We found that, with lower alpha values (low interactivity score), the prevalence of variables with an independent influence on long-term metastasis was higher (i.e., HER2, TNEG). As the value of alpha increased to 480, stronger interactions were needed to define clusters of factors that increased the risk of metastasis (i.e., ER, smoking, race, alcohol usage).

Conclusion: MBIL identified single and interacting risk factors of metastatic breast cancer, many of which were supported by clinical evidence. These results strongly recommend the development of further large data studies with different databases to validate the degree to which some of these variables impact metastatic breast cancer in the long term.

Keywords: Markov Blanket and Interactive Risk Factor Learner (MBIL); causal learning; machine learning; metastasis; metastatic breast cancer; risk factors.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
A BN DAG model illustrating the Markov blanket. The Markov blanket of T consists of nodes X11, X12, X13, X14 and X15. These nodes are the direct risk factors of T and separate T from the influence of the noisy variables X1–X10, X16 and X17 (adapted from [8]).
Figure 2
Figure 2
MBIL-generated causal sets of 5-, 10- and 15-year breast cancer metastases.
Figure 3
Figure 3
The MBIL generated an output of clinical interactions related to 5-, 10- and 15-year breast cancer metastases. HER2, ER, grade, race/ethnicity, TNEG, smoking/alcohol and surgical margins are represented. Each bar plot indicates the number of counts in which each of these variables was identified as a risk factor of metastasis at different values of alpha.

Similar articles

Cited by

References

    1. Sopik V., Sun P., Narod S.A. Predictors of time to death after distant recurrence in breast cancer patients. Breast Cancer Res. Treat. 2018;173:465–474. doi: 10.1007/s10549-018-5002-9. - DOI - PubMed
    1. Sestak I., Cuzick J. Markers for the identification of late breast cancer recurrence. Breast Cancer Res. 2015;17:10. doi: 10.1186/s13058-015-0516-0. - DOI - PMC - PubMed
    1. Davies C., Pan H., Godwin J., Gray R., Arriagada R., Raina V., Abraham M., Medeiros Alencar V.H., Badran A., Bonfill X., et al. Long-term effects of continuing adjuvant tamoxifen to 10 years versus stopping at 5 years after diagnosis of oestrogen receptor-positive breast cancer: ATLAS, a randomised trial. Lancet. 2013;381:805–816. doi: 10.1016/S0140-6736(12)61963-1. - DOI - PMC - PubMed
    1. Pan H., Gray R., Braybrooke J., Davies C., Taylor C., McGale P., Peto R., Pritchard K.I., Bergh J., Dowsett M., et al. 20-Year Risks of Breast-Cancer Recurrence after Stopping Endocrine Therapy at 5 Years. N. Engl. J. Med. 2017;377:1836–1846. doi: 10.1056/NEJMoa1701830. - DOI - PMC - PubMed
    1. Bhutiani N., Egger M.E., Ajkay N., Scoggins C.R., Martin R.C., 2nd, McMasters K.M. Multigene Signature Panels and Breast Cancer Therapy: Patterns of Use and Impact on Clinical Decision Making. J. Am. Coll. Surg. 2018;226:406–412.e1. doi: 10.1016/j.jamcollsurg.2017.12.043. - DOI - PubMed