Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2024 May 6;19(5):e0302171.
doi: 10.1371/journal.pone.0302171. eCollection 2024.

Comparing fatal crash risk factors by age and crash type by using machine learning techniques

Affiliations
Comparative Study

Comparing fatal crash risk factors by age and crash type by using machine learning techniques

Abdulaziz H Alshehri et al. PLoS One. .

Abstract

This study aims to use machine learning methods to examine the causative factors of significant crashes, focusing on accident type and driver's age. In this study, a wide-ranging data set from Jeddah city is employed to look into various factors, such as whether the driver was male or female, where the vehicle was situated, the prevailing weather conditions, and the efficiency of four machine learning algorithms, specifically XGBoost, Catboost, LightGBM and RandomForest. The results show that the XGBoost Model (accuracy of 95.4%), the CatBoost model (94% accuracy), and the LightGBM model (94.9% accuracy) were superior to the random forest model with 89.1% accuracy. It is worth noting that the LightGBM had the highest accuracy of all models. This shows various subtle changes in models, illustrating the need for more analyses while assessing vehicle accidents. Machine learning is also a transforming tool in traffic safety analysis while providing vital guidelines for developing accurate traffic safety regulations.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Distribution of driver age peaks in the 20–50-year range.
Fig 2
Fig 2. Accidents by crash type.
Fig 3
Fig 3. Accidents in different weather conditions.
Fig 4
Fig 4. Correlation heatmap between variable.
Fig 5
Fig 5. Methodology flow chart.
Fig 6
Fig 6. Confusion matrix for LightGBM model.
Fig 7
Fig 7. Confusion matrix for the XGBoost model.
Fig 8
Fig 8. Confusion matrix for CatBoost model.
Fig 9
Fig 9. Confusion matrix of random forest model.
Fig 10
Fig 10. Summary plot for the binary model.
Fig 11
Fig 11. Feature importance summary plot.
Fig 12
Fig 12. Feature importance bar chart.
Fig 13
Fig 13. SHAP value for fatal injuries.
Fig 14
Fig 14. SHAP value for non-fatal injuries.

Similar articles

References

    1. Alkahtani KF, Abdel-Aty M, Lee J (2019) A zonal level safety investigation of pedestrian crashes in Riyadh, Saudi Arabia. Int J Sustain Transp 13:255–267. 10.1080/15568318.2018.1463417 - DOI
    1. Lee J, Liu H, Abdel-Aty M (2023) Changes in traffic crash patterns: Before and after the outbreak of COVID-19 in Florida. Accid Anal Prev 190:107187. doi: 10.1016/j.aap.2023.107187 - DOI - PMC - PubMed
    1. Mohammed S, Alkhereibi AH, Abulibdeh A, et al. (2023) GIS-based spatiotemporal analysis for road traffic crashes; in support of sustainable transportation Planning. Transp Res Interdiscip Perspect 20:100836. 10.1016/j.trip.2023.100836 - DOI
    1. Shaaban K, Siam A, Badran A (2021) Analysis of Traffic Crashes and Violations in a Developing Country. Transp Res Procedia 55:1689–1695. 10.1016/j.trpro.2021.07.160 - DOI
    1. Jakobsen MD, Glies Vincents Seeberg K, Møller M, et al. (2023) Influence of occupational risk factors for road traffic crashes among professional drivers: systematic review. Transp Rev 43:533–563. 10.1080/01441647.2022.2132314 - DOI

Publication types