V3Det: Vast Vocabulary Visual Detection Dataset

V3Det is a vast vocabulary visual detection dataset, containing extremely large categories, which consist of 13,029 categories on real-world images.

BibTex: