Image Description - Groups

Flickr8K

Flickr8K dataset is a collection of 8,000 images with 5 sentences describing each image content

Dataset
JSON

Deep RNN and Memory Cells for Image Features

Generating natural language descriptions for images is a challenging task. The traditional way is to use the convolutional neural network (CNN) to extract image features,...

Dataset
JSON

Twitter A11y

The dataset used in the paper to evaluate the effectiveness of Twitter accessibility.

Dataset
JSON

Framing Image Description as a Ranking Task

A dataset for evaluating the performance of image description models.

Dataset
JSON

COCO Captions and Localized Narratives

The dataset used in the paper is COCO captions and Localized Narratives, which are used to generate image descriptions.

Dataset
JSON

Flickr30k

The Flickr30k dataset is widely utilized for image caption and image-text retrieval tasks, providing a substantial collection of images with associated captions.

Dataset
JSON

DTD

Texture classiﬁcation is an important and challenging problem in many image processing applications.

Dataset
JSON

Visual Genome

The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships.

Dataset
JSON

COCO

Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...

Dataset
JSON

MSCOCO

Human Pose Estimation (HPE) aims to estimate the position of each joint point of the human body in a given image. HPE tasks support a wide range of downstream tasks such as...

Dataset
JSON

10 datasets found