Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 2;22(Suppl 6):346.
doi: 10.1186/s12911-023-02113-7.

Leveraging process mining for modeling progression trajectories in amyotrophic lateral sclerosis

Affiliations

Leveraging process mining for modeling progression trajectories in amyotrophic lateral sclerosis

Erica Tavazzi et al. BMC Med Inform Decis Mak. .

Abstract

Background: Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease whose spreading and progression mechanisms are still unclear. The ability to predict ALS prognosis would improve the patients' quality of life and support clinicians in planning treatments. In this paper, we investigate ALS evolution trajectories using Process Mining (PM) techniques enriched to both easily mine processes and automatically reveal how the pathways differentiate according to patients' characteristics.

Methods: We consider data collected in two distinct data sources, namely the Pooled Resource Open-Access ALS Clinical Trials (PRO-ACT) dataset and a real-world clinical register (ALS-BS) including data of patients followed up in two tertiary clinical centers of Brescia (Italy). With a focus on the functional abilities progressively impaired as the disease progresses, we use two Process Discovery methods, namely the Directly-Follows Graph and the CareFlow Miner, to mine the population disease trajectories on the PRO-ACT dataset. We characterize the impairment trajectories in terms of patterns, timing, and probabilities, and investigate the effect of some patients' characteristics at onset on the followed paths. Finally, we perform a comparative study of the impairment trajectories mined in PRO-ACT versus ALS-BS.

Results: We delineate the progression pathways on PRO-ACT, identifying the predominant disabilities at different stages of the disease: for instance, 85% of patients enter the trials without disabilities, and 48% of them experience the impairment of Walking/Self-care abilities first. We then test how a spinal onset increases the risk of experiencing the loss of Walking/Self-care ability as first impairment (52% vs. 27% of patients develop it as the first impairment in the spinal vs. the bulbar cohorts, respectively), as well as how an older age at onset corresponds to a more rapid progression to death. When compared, the PRO-ACT and the ALS-BS patient populations present some similarities in terms of natural progression of the disease, as well as some differences in terms of observed trajectories plausibly due to the trial scheduling and recruitment criteria.

Conclusions: We exploited PM to provide an overview of the evolution scenarios of an ALS trial population and to preliminary compare it to the progression observed in a clinical cohort. Future work will focus on further improving the understanding of the disease progression mechanisms, by including additional real-world subjects as well as by extending the set of events considered in the impairment trajectories.

Keywords: Amyotrophic lateral sclerosis; Patient stratification; Process discovery; Process mining; Prognosis; Progression trajectories.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
DFG graph representing the paths followed by the study population, delineating the increase in disability experienced by the subjects. Only the arcs with a transition probability >0.03 are shown
Fig. 2
Fig. 2
a Zoom on the DFG deltaGraph obtained stratifying the population by onset site (spinal vs. bulbar). The highlighted edges represent an increased transition probability for the spinal (red) or bulbar (green) cohort, thresholded for displaying only differences between the probabilities greater than 0.2. b KM curves of the time passing from M_0000 (no impaired domains) to M_1000 (Walking/Self-care domain impaired) for the two cohorts. The log-rank test shows statistically significant differences between the cohorts. + indicates censored subjects
Fig. 3
Fig. 3
CFM graph built starting from M_0000. Each node reports the total number of patients passing through it (round brackets) and the min-median-max time, in days, needed to reach it from the root. Colors are graded on the median times, with intervals: <100, 101-200, 201-400, and >401 days. The edges report the percentage of patients passing through the child node with respect to the previous node (above) and the entire population (below). The graph has been thresholded for displaying only the pathways transitioned by at least 10 subjects
Fig. 4
Fig. 4
CFM graphs built starting from M_0000 and stratified for a quantized age at onset or b onset site. Each node reports the number of patients passing trough it for each cohort (young/aged onset and spinal/bulbar onset, respectively). Moreover, the ratio of the two cardinalities with respect to the initial populations is reported in the round brackets, followed by the p-value for the Fisher’s exact/χ2 test. The node box is colored in yellow if the p-value is lower than a given threshold (here 0.05). Both the graphs have been thresholded for displaying only the pathways transitioned by at least 10 subjects
Fig. 5
Fig. 5
On the top, a portion of the DFG showing the differences between PRO-ACT and ALS–BS in terms of probability to move from a state to the next one is shown. The gray edges represents the transitions whose probabilities differ for less than 0.2 in the two datasets. When this gap is larger, the edge is green if ALS–BS has a probability higher than PRO-ACT, red otherwise. On the bottom, the kernel density estimation of the probability to move (a) from M_1000 to M_0101; (b) from M_1000 to M_1110, and (c) from M_1001 to M_1111 are reported, with PRO-ACT in blue and ALS–BS in red, and with the vertical dotted line indicating the median of the distributions

References

    1. Beghi E, Chiò A, Couratier P, Esteban J, Hardiman O, Logroscino G, Millul A, Mitchell D, Preux P-M, Pupillo E. The epidemiology and treatment of ALS: focus on the heterogeneity of the disease and critical appraisal of therapeutic trials. Amyotroph Lateral Scler. 2011;12(1):1–10. doi: 10.3109/17482968.2010.502940. - DOI - PMC - PubMed
    1. Goyal NA, Berry JD, Windebank A, Staff NP, Maragakis NJ, van den Berg LH, Genge A, Miller R, Baloh RH, Kern R, Gothelf Y, Lebovits C, Cudkowicz M. Addressing heterogeneity in amyotrophic lateral sclerosis CLINICAL TRIALS. Muscle Nerve. 2020;62(2):156–166. doi: 10.1002/mus.26801. - DOI - PMC - PubMed
    1. Küffner R, Zach N, Norel R, Hawe J, Schoenfeld D, Wang L, Li G, Fang L, Mackey L, Hardiman O. Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression. Nat Biotechnol. 2015;33(1):51. doi: 10.1038/nbt.3051. - DOI - PubMed
    1. Kueffner R, Zach N, Bronfeld M, Norel R, Atassi N, Balagurusamy V, Di Camillo B, Chio A, Cudkowicz M, Dillenberger D. Stratification of amyotrophic lateral sclerosis patients: a crowdsourcing approach. Sci Rep. 2019;9(1):690. doi: 10.1038/s41598-018-36873-4. - DOI - PMC - PubMed
    1. Grollemund V, Le Chat G, Secchi-Buhour M-S, Delbot F, Pradat-Peyre J-F, Bede P, Pradat P-F. Manifold learning for amyotrophic lateral sclerosis functional loss assessment. J Neurol. 2021;268(3):825–850. doi: 10.1007/s00415-020-10181-2. - DOI - PubMed

Publication types