Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends

dc.contributor.authorTami, Mohammad
dc.contributor.authorAbu Elhenawy, Mohammed
dc.contributor.authorAshqar, Huthaifa
dc.date.accessioned2025-10-16T15:27:10Z
dc.date.issued2025-04-21
dc.description.abstractTraffic safety remains a critical global challenge, with traditional Advanced Driver-Assistance Systems (ADAS) often struggling in dynamic real-world scenarios due to fragmented sensor processing and susceptibility to adversarial conditions. This paper reviews the transformative potential of Multimodal Large Language Models (MLLMs) in addressing these limitations by integrating cross-modal data such as visual, spatial, and environmental inputs to enable holistic scene understanding. Through a comprehensive analysis of MLLM-based approaches, we highlight their capabilities in enhancing perception, decision-making, and adversarial robustness, while also examining the role of key datasets (e.g., KITTI, DRAMA, ML4RoadSafety) in advancing research. Furthermore, we outline future directions, including real-time edge deployment, causality-driven reasoning, and human-AI collaboration. By positioning MLLMs as a cornerstone for next-generation traffic safety systems, this review underscores their potential to revolutionize the field, offering scalable, context-aware solutions that proactively mitigate risks and improve overall road safety.
dc.description.urihttp://arxiv.org/abs/2504.16134
dc.format.extent15 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m202q9-ay7t
dc.identifier.urihttps://doi.org/10.48550/arXiv.2504.16134
dc.identifier.urihttp://hdl.handle.net/11603/40446
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Data Science
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectComputer Science - Computer Vision and Pattern Recognition
dc.subjectComputer Science - Computation and Language
dc.titleMultimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-6835-8338

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2504.16134v1.pdf
Size:
691.33 KB
Format:
Adobe Portable Document Format