The Caltech dataset is a benchmark for pedestrian detection, consisting of approximately 10 hours of video taken from a vehicle driving through Los Angeles.
The KITTI 2015 dataset is a real-world dataset of street views, containing 200 training stereo image pairs with sparsely labeled disparity from LiDAR data.