Class-imbalanced crash prediction based on real-time traffic and weather data: A driving simulator study
- PMID: 32125890
- DOI: 10.1080/15389588.2020.1723794
Class-imbalanced crash prediction based on real-time traffic and weather data: A driving simulator study
Abstract
Objective: Crash occurrence prediction has been of major importance in proactively improving traffic safety and reducing potential inconveniences to road users. Conventional statistical crash prediction models frequently suffer from severe data quality issues and require a significant amount of historical data. On the other hand, even though machine learning (ML) based algorithms have proven to be powerful in predicting future outcomes in different fields of applications, they likely fail to provide satisfactory results unless a tuning parameter approach is conducted. The main objective of this article is to develop real-time crash prediction models that will potentially be employed within traffic management systems.Methods: In this study, two highly optimized data-driven models for crash occurrence prediction have been designed based on the popular machine learning techniques, Support Vector Machine (SVM) and deep neural network Multilayer Perceptron (MLP). To ensure that the proposed algorithms produce robust and stable performance, the optimal scheme for models' construction has been thoroughly examined and discussed. Additionally, the further boost of models' performance requires the systemic assessment of crash strongest precursors within the driver-vehicle-environment triptych. Therefore, three categories of features, including driver input responses, vehicle kinematics and weather conditions, were measured during the execution of various driving tasks performed on a desktop driving simulator. Moreover, since crash events typically occur in rare instances tending to be underrepresented in the dataset, an imbalance-aware strategy to overcome the issue was adopted using the Synthetic Minority Oversampling TEchnique (SMOTE).Results: The results show that MLP exhibited the best performing prediction results, most particularly, in clear, overcast and snow conditions, in which MLP recall values were above 94%. Higher F1-score values were achieved in overcast and rain weather by MLP and snow conditions by SVM; whereas over 90% of G-mean levels were obtained under fog and rain conditions for MLP and snow condition for SVM.Conclusion: The findings provide new insights into crash events forecasting and may be used to promote enforcement efforts related to designing crash avoidance/warning systems that enhance the effectiveness of the system's application based on driver input and vehicle kinematics under various weather conditions.
Keywords: Crash prediction; SMOTE; driving simulator; machine learning; multilayer perceptron; support vector machine.
Similar articles
-
Severity analysis of road transport accidents of hazardous materials with machine learning.Traffic Inj Prev. 2021;22(4):324-329. doi: 10.1080/15389588.2021.1900569. Epub 2021 Apr 13. Traffic Inj Prev. 2021. PMID: 33849325
-
Detecting lane change maneuvers using SHRP2 naturalistic driving data: A comparative study machine learning techniques.Accid Anal Prev. 2020 Jul;142:105578. doi: 10.1016/j.aap.2020.105578. Epub 2020 May 11. Accid Anal Prev. 2020. PMID: 32408143
-
Efficient mapping of crash risk at intersections with connected vehicle data and deep learning models.Accid Anal Prev. 2020 Sep;144:105665. doi: 10.1016/j.aap.2020.105665. Epub 2020 Jul 16. Accid Anal Prev. 2020. PMID: 32683130
-
Multivariate copula temporal modeling of intersection crash consequence metrics: A joint estimation of injury severity, crash type, vehicle damage and driver error.Accid Anal Prev. 2019 Apr;125:188-197. doi: 10.1016/j.aap.2019.01.036. Epub 2019 Feb 13. Accid Anal Prev. 2019. PMID: 30771588 Review.
-
Advances, challenges, and future research needs in machine learning-based crash prediction models: A systematic review.Accid Anal Prev. 2024 Jan;194:107378. doi: 10.1016/j.aap.2023.107378. Epub 2023 Nov 15. Accid Anal Prev. 2024. PMID: 37976634
Cited by
-
Exploring the effects of stationary camera spots on inferences drawn from real-time crash severity models.Sci Rep. 2022 Nov 25;12(1):20321. doi: 10.1038/s41598-022-24102-y. Sci Rep. 2022. PMID: 36434001 Free PMC article.
-
An explainable multi-task deep learning framework for crash severity prediction using multi-source data.Sci Rep. 2025 Jul 1;15(1):21978. doi: 10.1038/s41598-025-09226-1. Sci Rep. 2025. PMID: 40596431 Free PMC article.
-
Leveraging Wearable Sensors in Virtual Reality Driving Simulators: A Review of Techniques and Applications.Sensors (Basel). 2024 Jul 8;24(13):4417. doi: 10.3390/s24134417. Sensors (Basel). 2024. PMID: 39001197 Free PMC article.
-
Crash severity analysis: A data-enhanced double layer stacking model using semantic understanding.Heliyon. 2024 Apr 29;10(9):e30117. doi: 10.1016/j.heliyon.2024.e30117. eCollection 2024 May 15. Heliyon. 2024. PMID: 38765089 Free PMC article.
-
Science fiction or clinical reality: a review of the applications of artificial intelligence along the continuum of trauma care.World J Emerg Surg. 2023 Mar 6;18(1):16. doi: 10.1186/s13017-022-00469-1. World J Emerg Surg. 2023. PMID: 36879293 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources