ATR dataset

Human parsing has recently attracted a huge amount of interests and achieved great progress with the advance of deep convolutional neural networks and large-scale datasets. Most of the prior works focus on developing new structures and auxiliary information guidance to improve general feature representation, such as dilated convolution, LSTM structure, encoder-decoder architecture, and human pose constraints. Although these methods show promising results on each human parsing dataset, they directly use one flat prediction layer to classify all labels, which disregards the intrinsic semantic correlations across concepts and utilize the annotations in an inefficient way.

Data and Resources

Cite this as

Jianshu Li, Yidong Li, Jian Zhao, Yunchao Wei, Congyan Lang, Jiashi Feng, Shuicheng Yan, Terence Sim (2024). Dataset: ATR dataset. https://doi.org/10.57702/tj3t36k5

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.1705.07206
Citation
  • https://doi.org/10.48550/arXiv.1904.04536
Author Jianshu Li
More Authors
Yidong Li
Jian Zhao
Yunchao Wei
Congyan Lang
Jiashi Feng
Shuicheng Yan
Terence Sim
Homepage https://ai.stanford.edu/~jgao/papers/ATR.pdf