Datasets & Evaluation Metrics

Voice-over: ../assets/audio/datasets.mp3


Crosswalk research often combines general-purpose driving datasets with smaller, task-specific sets. Large public datasets help with pretraining and benchmarking (e.g., city scenes, semantic labels), while many papers also curate their own crosswalk crops or masks for fine-tuning and evaluation [1] [7].

Commonly Used Datasets

Dataset Type Why it’s useful
Cityscapes Urban street scenes with pixel-level labels High-quality segmentation ground truth for roads, sidewalks, people, vehicles, etc.; ideal for training/validating segmentation backbones.
KITTI Driving scenes (detection, stereo, tracking) Standard benchmarks for detector backbones and evaluation protocols.
Task-Specific Crosswalk Sets Custom crops / masks from dashcams or maps Many papers build small labeled sets of crosswalk patches or masks to fine-tune detectors/segmenters on the target task [1] [7].

Evaluation Metrics

Cityscapes dataset segmentation example 1
Cityscapes example with semantic labels (roads, sidewalks, people, vehicles). Source: Cityscapes dataset.
Cityscapes dataset segmentation example 2
Another annotated Cityscapes scene, commonly used to pretrain/evaluate segmentation backbones. Source: Cityscapes dataset.
Cityscapes dataset segmentation example 3
Intersection view with multiple labeled classes. Source: Cityscapes dataset.
Cityscapes Zurich example with pixel-level segmentation
Cityscapes example from Zurich, annotated with semantic segmentation. Source: Cityscapes dataset.
References on this page: [1] [7]