ATR dataset

doi:doi:10.57702/tj3t36k5

ATR dataset

Human parsing has recently attracted a huge amount of interests and achieved great progress with the advance of deep convolutional neural networks and large-scale datasets. Most of the prior works focus on developing new structures and auxiliary information guidance to improve general feature representation, such as dilated convolution, LSTM structure, encoder-decoder architecture, and human pose constraints. Although these methods show promising results on each human parsing dataset, they directly use one flat prediction layer to classify all labels, which disregards the intrinsic semantic correlations across concepts and utilize the annotations in an inefficient way.

BibTex:

@dataset{Jianshu_Li_and_Yidong_Li_and_Jian_Zhao_and_Yunchao_Wei_and_Congyan_Lang_and_Jiashi_Feng_and_Shuicheng_Yan_and_Terence_Sim_2024,
    abstract = {Human parsing has recently attracted a huge amount of interests and achieved great progress with the advance of deep convolutional neural networks and large-scale datasets. Most of the prior works focus on developing new structures and auxiliary information guidance to improve general feature representation, such as dilated convolution, LSTM structure, encoder-decoder architecture, and human pose constraints. Although these methods show promising results on each human parsing dataset, they directly use one flat prediction layer to classify all labels, which disregards the intrinsic semantic correlations across concepts and utilize the annotations in an inefficient way.},
    author = {Jianshu Li and Yidong Li and Jian Zhao and Yunchao Wei and Congyan Lang and Jiashi Feng and Shuicheng Yan and Terence Sim},
    doi = {10.57702/tj3t36k5},
    institution = {No Organization},
    keyword = {'18 labels', 'human parsing', 'semantic segmentation', 'single persons', 'upright position'},
    month = {dec},
    publisher = {TIB},
    title = {ATR dataset},
    url = {https://service.tib.eu/ldmservice/dataset/atr-dataset},
    year = {2024}
}