The image object detection track on VisDrone2018 provides a dataset of 10,209 images, with 10 categories of pedestrians, vehicles, and other traffic objects annotated. -
CrowdHuman is a challenging benchmark to evaluate the ability of crowded scene detection of detectors, which contains about 15k training images and 4k images for evaluation.