Comparative Analysis & Quiz • Crosswalk Tutorial

Comparative Analysis

Voice-over: ../assets/audio/analysis.mp3

This page summarizes what prior research reports about when different methods work well, where they fail, and what trade-offs to expect. These are literature-based observations, not new experiments.

Classical CV (Edges/Lines/Periodicity)

Strengths: fast, simple, interpretable; runs on limited hardware.
Works best: clean paint, good contrast, daylight.
Weaknesses: fails under wear, shadows, occlusion, or unusual designs [1].

Deep Learning (Detectors/Segmentation)

Strengths: better tolerance to viewpoint changes and partial occlusion.
Works best: when trained with diverse data; clear daytime scenes.
Weaknesses: domain shift (new cities/night), compute for real-time; needs careful evaluation of FPS/latency [2] [7].

VLM / Multimodal (Context & Reasoning)

Strengths: adds scene context and explanations (e.g., is it safe to cross?).
Works best: as a complement to fast detectors—reasoning about tricky cases.
Weaknesses: latency/compute, reliability of explanations, sensitivity to prompts [3] [5].

Key Takeaways

No single method wins everywhere. Clean scenes favor classical methods; complex scenes need CNNs; edge cases may benefit from VLM reasoning.

Data matters. Diversity across cities, weather, and lighting improves robustness (but domain shift remains a challenge) [7].

Deployment = accuracy + speed. Report mAP/mIoU and FPS/latency; embedded use often requires compression/distillation [7].

Quick Quiz (5 Questions)

References on this page: [1] [2] [3] [5] [7]