Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 1;15(1):21978.
doi: 10.1038/s41598-025-09226-1.

An explainable multi-task deep learning framework for crash severity prediction using multi-source data

Affiliations

An explainable multi-task deep learning framework for crash severity prediction using multi-source data

Yuanyuan Xiao et al. Sci Rep. .

Abstract

Traffic accidents pose significant global challenges, causing substantial injuries, fatalities, and economic losses. Current research predominantly focuses on single-prediction objectives (e.g., fatality prediction) while neglecting property damage assessments and critical interactions between prediction tasks. Although neural networks demonstrate superior predictive capabilities, their application in traffic safety analysis remains constrained by inherent limitations in causal interpretability, coupled with challenges posed by data imbalance, heterogeneity, and complexity in crash datasets. This study proposes an interpretable multi-task learning framework (Adv MT-DNN) that synergistically integrates an enhanced deep neural network with post-hoc explanation methods for comprehensive crash severity prediction. Our dual-focused approach addresses multiple prediction targets (including fatalities, severe injuries, and property damage). It provides granular insights into contributing factors through SHAP-based feature importance rankings and interaction analysis. Validated using four-year (2018-2021) multi-source traffic data from China, the framework demonstrates significant improvements in prediction accuracy compared to baselines. Nonparametric estimation of the top-8 critical factors (e.g., blood alcohol content, collision type, and accident occurrence period) confirms statistically significant associations with crash severity. The explicit interpretation mechanism bridges the critical gap between predictive performance and model interpretability in traffic safety analytics, providing engineering-relevant insights. This research establishes a robust methodological foundation for developing data-driven road safety policies and intelligent transportation systems, particularly in developing countries with complex traffic ecosystems.

Keywords: Crash severity prediction; Explainable AI (XAI); Multi-source traffic data; Multi-task learning.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The research framework.
Fig. 2
Fig. 2
Model results between balanced and RTA datasets.
Fig. 3
Fig. 3
(a) ROC curves per class and mean and (b) epochs process.
Fig. 4
Fig. 4
Local explanation comparison with SHAP and LIME.
Fig. 5
Fig. 5
SHAP values.
Fig. 6
Fig. 6
SHAP values for driver experience (De).
Fig. 7
Fig. 7
SHAP values for age (Ag).
Fig. 8
Fig. 8
SHAP values for gender (Gd).
Fig. 9
Fig. 9
SHAP values for drunk driving (Bac).
Fig. 10
Fig. 10
SHAP values for safety device usage (Sdu).
Fig. 11
Fig. 11
SHAP values for road type (Rt).
Fig. 12
Fig. 12
SHAP values for accident occurring period (Aop).
Fig. 13
Fig. 13
Interaction effects between workday (Wd) and road type (Rt).
Fig. 14
Fig. 14
Interaction effects with visibility (Vis) and drunk driving (Bac).
Fig. 15
Fig. 15
Interaction effects with alcohol content (Bac) and age (Ag).
Fig. 16
Fig. 16
Interaction effects between gender (Gd) and driver attributes.
Fig. 17
Fig. 17
Interaction effects between motor vehicle (Mv) and vehicle safety status (Vss).
Fig. 18
Fig. 18
Interaction effects between safety device (Sdu) and vehicle safety status (Vss).

Similar articles

References

    1. World Health Organization. Global Status Report on Road Safety 2015 (World Health Organization, 2015).
    1. National Bureau of Statistics of China. China Statistical Yearbook 2020. (2020).
    1. Tambouratzis, T., Souliou, D., Chalikias, M. & Gregoriades, A. Maximising accuracy and efficiency of traffic accident prediction combining information mining with computational intelligence approaches and decision trees. J. Artif. Intell. Soft Comput. Res.4 (1), 31–42. 10.2478/jaiscr-2014-0023 (Jan. 2014).
    1. Zhu, L., Lu, L., Zhang, W., Zhao, Y. & Song, M. Analysis of accident severity for curved roadways based on bayesian networks. Sustainability11 (8). 10.3390/su11082223 (2019).
    1. Arteaga, C., Paz, A. & Park, J. Injury severity on traffic crashes: A text mining with an interpretable machine-learning approach. Saf. Sci.132, 104988. 10.1016/j.ssci.2020.104988 (Dec. 2020).

LinkOut - more resources