
The SYSU-30k dataset is a large-scale weakly supervised person ReID dataset with over 29 million images gathered from TV program videos. The videos are randomly broken into clips, and then each clip is manually annotated with an identity, but all detected people are noisily assigned that identity, forming bag-level labels.
