Annotated Bibliography
Voice-over: ../assets/audio/bibliography.mp3
References are listed in ACM citation style, in order of first appearance. Each includes a synopsis and reliability rating.
-
K. Romić, I. Galić, H. Leventić, and M. Habijan. 2021.
Pedestrian Crosswalk Detection Using a Column and Row Structure Analysis in Assistance Systems for the Visually Impaired.
Acta Polytechnica Hungarica 18, 7.
DOI: PDF
Synopsis: Rule-based detection using row/column intensity and periodicity; evaluates in assistive settings.
Reliability: Peer-reviewed journal (High). -
M. Haider et al. 2025.
Advanced Zebra Crosswalk Detection Using Deep Learning Techniques.
International Journal of Engineering and Science Education 13, 3.
DOI: PDF
Synopsis: Deep learning pipeline (detection + segmentation) tested under occlusion and varied lighting.
Reliability: Academic venue (Medium; confirm peer-review). -
H. Hwang, S. Kwon, Y. Kim, and D. Kim. 2024.
Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing.
arXiv preprint arXiv:2402.06794.
DOI: arXiv
Synopsis: Applies GPT-4V for assessing crossing safety and generating explanations.
Reliability: Preprint (Medium). -
M. Liu, J. Jiang, C. Zhu, and X.-C. Yin. 2023.
VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023).
DOI: PDF
Synopsis: Vision-language supervision improves pedestrian detection under occlusion and crowding.
Reliability: Top-tier conference (High). -
C. Zhang et al. 2023/2024.
Vision-Language Models in Autonomous Driving: A Survey and Outlook.
arXiv preprint arXiv:2310.14414.
DOI: arXiv
Synopsis: Comprehensive survey of VLMs in autonomous driving, covering perception, planning, and open challenges.
Reliability: Survey preprint (Medium). -
J. Fan, J. Wu, J. Gao, J. Yu, Y. Wang, H. Chu, and B. Gao. 2024.
MLLM-SUL: Multimodal LLM for Scene Understanding and Risk Localization in Traffic.
arXiv preprint arXiv:2412.19406.
DOI: arXiv
Synopsis: Demonstrates a multimodal LLM that narrates traffic scenes and highlights risk zones.
Reliability: Preprint (Medium). -
Z.-D. Zhang, M.-L. Tan, Z.-C. Lan, H.-C. Liu, L. Pei, and W.-X. Yu. 2022.
CDNet: A Real-Time and Robust Crosswalk Detection Network on Jetson Nano Based on YOLOv5.
Neural Computing and Applications.
DOI: 10.1007/s00521-022-07007-9
Synopsis: YOLOv5-based crosswalk detector optimized for Jetson; reports strong FPS and accuracy.
Reliability: Peer-reviewed journal (High).
Tip: use inline citations like
<a class="cite" href="bibliography.html#ref-3">[3]</a>
in your tutorial pages so readers can jump here.