Fine-grained visual comparisons with local learning. This dataset comprises 50,025 shoe images. It consists of 4 attributes containing 34 classes each.
An open dataset for attribute classification and street-to-shop image retrieval, comprising 253,983 images and 9 attributes. Each attribute contains 185 classes.