No Organization - Organizations

Fashion-Mnist

A binary imbalanced classification dataset with 28 × 28 grayscale images of 10 classes corresponding to fashion products.

Dataset
JSON

Mnist

A binary imbalanced classification dataset with 28 × 28 grayscale images of 10 classes corresponding to digits from 0 to 9.

Dataset
JSON

CVBL video database

CVBL video database for face recognition in videos

Dataset
JSON

Opening Up Minds with Argumentative Dialogues

A dataset of 183 argumentative dialogues about 3 controversial topics: veganism, Brexit and COVID-19 vaccination.

Dataset
JSON

GlyphGAN: Style-Consistent Font Generation Based on Generative Adversarial Ne...

Font generation experiment using GlyphGAN, including legibility, diversity, and style consistency evaluation.

Dataset
JSON

REDS

The proposed StableVSR is built upon a pre-trained Latent Diffusion Model (LDM) for single image super-resolution (SISR). We use Stable Diffusion ×4 Upscaler (SD×4Upscaler)2. It...

Dataset
JSON

Vimeo-90K

The proposed StableVSR is built upon a pre-trained Latent Diffusion Model (LDM) for single image super-resolution (SISR). We use Stable Diffusion ×4 Upscaler (SD×4Upscaler)2. It...

Dataset
JSON

Devil in the Number: Towards Robust Multi-modality Data Filter

The dataset used in the paper is a web-scale dataset for training a vision-language model. The dataset contains text-image pairs, and the authors propose a novel filter to...

Dataset
JSON

CamVid

The dataset used in the paper is a pre-trained ResNet-50 classiﬁer, which is used for image synthesis, unpaired image-to-image translation, and feature similarity estimation.

Dataset
JSON

ToyADMOS

ToyADMOS: A dataset of miniature-machine operating sounds for anomalous sound detection

Dataset
JSON

MIMII

A common assumption of novelty detection is that the distribution of both “normal” and “novel” data are static. This, however, is often not the case—for example scenarios where...

Dataset
JSON

McMaster18

The dataset used in the paper for image deblurring tasks.

Dataset
JSON

Kodak24

The dataset used in the paper is Kodak24, a dataset for image denoising.

Dataset
JSON

LLaMA-AdapterV2

LLaMA-AdapterV2: A parameter-efficient visual instruction model for text-image generation.

Dataset
JSON

M2Chat: Empowering VLM for Multimodal LLM Interleaved

M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios.

Dataset
JSON

Hand-drawn Symbol Recognition of Surgical Flowsheet Graphs with Deep Image Se...

The dataset used in this paper for hand-drawn symbol recognition of surgical flowsheet graphs with deep image segmentation.

Dataset
JSON

SFEW

The SFEW dataset is a subset of EmotiW2015 in-the-wild emotion dataset.

Dataset
JSON

EmotioNet

Label distribution learning on auxiliary label space graphs for facial expression recognition.

Dataset
JSON

AffectNet

The dataset consists of a significant collection of 60,000 facial expression images, categorized into eight different classes, including neutral, happy, angry, sad, fear,...

Dataset
JSON

RAF-DB

Facial Expression Recognition (FER) is a classiﬁcation task that points to face variants. Hence, there are certain afﬁnity features between facial expressions, receiving lit-

Dataset
JSON

24,167 datasets found