LLM Multimodal Traffic Accident Forecasting

I de Zarzà^{1

2

3}, J de Curtò^{1

2

3}, Gemma Roig^{1

4}, Carlos T Calafate²

Affiliations

¹ Informatik und Mathematik, GOETHE-University Frankfurt am Main, 60323 Frankfurt am Main, Germany.
² Departamento de Informática de Sistemas y Computadores, Universitat Politècnica de València, 46022 València, Spain.
³ Estudis d'Informàtica, Multimèdia i Telecomunicació, Universitat Oberta de Catalunya, 08018 Barcelona, Spain.
⁴ HESSIAN Center for AI (hessian.AI), 64289 Darmstadt, Germany.

PMID: 38005612
PMCID: PMC10674612
DOI: 10.3390/s23229225

LLM Multimodal Traffic Accident Forecasting

I de Zarzà et al. Sensors (Basel). 2023.

. 2023 Nov 16;23(22):9225.

doi: 10.3390/s23229225.

Authors

I de Zarzà^{1

2

3}, J de Curtò^{1

2

3}, Gemma Roig^{1

4}, Carlos T Calafate²

Affiliations

¹ Informatik und Mathematik, GOETHE-University Frankfurt am Main, 60323 Frankfurt am Main, Germany.
² Departamento de Informática de Sistemas y Computadores, Universitat Politècnica de València, 46022 València, Spain.
³ Estudis d'Informàtica, Multimèdia i Telecomunicació, Universitat Oberta de Catalunya, 08018 Barcelona, Spain.
⁴ HESSIAN Center for AI (hessian.AI), 64289 Darmstadt, Germany.

PMID: 38005612
PMCID: PMC10674612
DOI: 10.3390/s23229225

Abstract

With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) driving assistants with actionable visual and language cues. Using a rich dataset detailing accident occurrences, we juxtaposed the Transformer model against traditional time series models like ARIMA and the more recent Prophet model. Additionally, through detailed analysis, we delved deep into feature importance using principal component analysis (PCA) loadings, uncovering key factors contributing to accidents. We introduce the idea of using real-time interventions with large language models (LLMs) in autonomous driving with the use of lightweight compact LLMs like LLaMA-2 and Zephyr-7b-α. Our exploration extends to the realm of multimodality, through the use of Large Language-and-Vision Assistant (LLaVA)-a bridge between visual and linguistic cues by means of a Visual Language Model (VLM)-in conjunction with deep probabilistic reasoning, enhancing the real-time responsiveness of autonomous driving systems. In this study, we elucidate the advantages of employing large multimodal models within DL and deep probabilistic programming for enhancing the performance and usability of time series forecasting and feature weight importance, particularly in a self-driving scenario. This work paves the way for safer, smarter cities, underpinned by data-driven decision making.

Keywords: LLM; LLaVA; PCA loadings; VLM; accident forecasting; time series analysis; transformers.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Figures

**Figure 1**
Correlation matrix visualization: This plot provides a visual representation of the pairwise correlation between the selected numeric attributes from the accident dataset. The color intensity and the size of the circles correspond to the correlation coefficients. Positive correlations are displayed in blue and negative correlations in red. This visualization aids in identifying attributes that might have a strong association with each other, providing insights into potential multicollinearity and relationships within the data.

**Figure 2**
Accidents distribution over the week by time of day.

**Figure 3**
Model forecast with Prophet, with daily and yearly seasonality. The x-axis represents the date. The y-axis denotes the number of accidents. The dark dots represent the actual historical data of accidents, while the light blue line illustrates the forecasted values. The shaded blue region represents the uncertainty intervals of the forecast.

**Figure 4**
Model training and validation loss for the Transformer architecture.

**Figure 5**
Comparison of model predictions against actual values.

**Figure 6**
Top feature weights across different components of the PCA loadings.

**Figure 7**
Diagrammatic representation of the image hashing and retrieval mechanism infused with DL and enriched by LLaVA.

**Figure 8**
Illustrative representation of a sample database $D$ consisting of 12 distinct traffic images representing edge cases selected by the self-driving operators.

**Figure 9**
Histogram illustrating the variance of each feature derived from the Bayesian ResNet. Each bar represents the variance of a specific feature, highlighting the spread and uncertainty in the feature representations. The horizontal axis enumerates the features, while the vertical axis quantifies the variance, emphasizing regions of high and low confidence in the extracted features.

See this image and copyright information in PMC

References

1. Guo Y., Tang Z., Guo J. Could a Smart City Ameliorate Urban Traffic Congestion? A Quasi-Natural Experiment Based on a Smart City Pilot Program in China. Sustainability. 2020;12:2291. doi: 10.3390/su12062291. - DOI
1. Zonouzi M.N., Kargari M. Modeling uncertainties based on data mining approach in emergency service resource allocation. Comput. Ind. Eng. 2020;145:106485. doi: 10.1016/j.cie.2020.106485. - DOI
1. Vlahogianni E.I., Karlaftis M.G., Golias J.C. Short-term traffic forecasting: Where we are and where we’re going. Transp. Res. Part Emerg. Technol. 2014;43:3–19. doi: 10.1016/j.trc.2014.01.005. - DOI
1. Weng W., Fan J., Wu H., Hu Y., Tian H., Zhu F., Wu J. A Decomposition Dynamic graph convolutional recurrent network for traffic forecasting. Pattern Recognit. 2023;142:109670. doi: 10.1016/j.patcog.2023.109670. - DOI
1. Negash N.M., Yang J. Driver Behavior Modeling Towards Autonomous Vehicles: Comprehensive Review. IEEE Access. 2023;11:22788–22821. doi: 10.1109/ACCESS.2023.3249144. - DOI

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

LLM Multimodal Traffic Accident Forecasting

Affiliations

LLM Multimodal Traffic Accident Forecasting

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources