Dataset - LDM

Multimodal Learning Task

The dataset used in the paper is a multimodal learning task for robots.
- Dataset
- JSON
Multimodal Categorization Task

The dataset used in the paper is a multimodal categorization task using image data and speech signals.
- Dataset
- JSON
DEAP

A large-scale city-wise dataset for exploring the relationships among air pollutants and their causal agents over time.
- Dataset
- JSON
MAMI dataset

The MAMI dataset is a collection of images and text posts used for training and testing the proposed multimodal model for misogyny identification.
- Dataset
- JSON
XD-Violence

The XD-Violence dataset is a large-scale multimodal video dataset for violence detection. It consists of 4,754 untrimmed videos with a total duration of 217 hours, covering six...
- Dataset
- JSON
Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence ...

Weakly supervised multimodal violence detection aims to learn a violence detection model by leveraging multiple modalities such as RGB, optical flow, and audio, while only...
- Dataset
- JSON
VQA

The VQA dataset is a large-scale visual question answering dataset that consists of pairs of images that require natural language answers.
- Dataset
- JSON
DeepFashion Multimodal dataset

DeepFashion Multimodal dataset contains 12701 full-body images in 24 categories
- Dataset
- JSON
BraTS 2020 Challenge

The BraTS 2020 challenge dataset is a multimodal MRI brain tumor segmentation dataset. It contains 369 subjects with 4 MRI modalities (T2 weighted FLAIR, T1 weighted, T1...
- Dataset
- JSON
BraTS 2020

Automatic segmentation of brain tumors is an essential but challenging step for extracting quantitative imaging biomarkers for accurate tumor detection, diagnosis, prognosis,...
- Dataset
- JSON
Stanford Drone Dataset (SDD)

The Stanford Drone Dataset (SDD) is a large-scale dataset that consists of 60 aerial-view videos captured by drones over Stanford University. SDD contains positions of more than...
- Dataset
- JSON
MMAUD

Multimodal anti-UAV dataset for modern miniature drone threats
- Dataset
- JSON
MIMIC-CXR-JPG

MIMIC-CXR-JPG dataset comprises 227,835 imaging studies conducted on 64,588 patients who sought treatment at the BIDMC Emergency Department from 2011 to 2016.
- Dataset
- JSON
LADI-VTON

LADI-VTON: Latent diffusion textual-inversion enhanced model for virtual try-on
- Dataset
- JSON
Multimodal Garment Designer

Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
- Dataset
- JSON
FashionSD-X

FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion
- Dataset
- JSON
RAVDESS

RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song) dataset contains 24 professional actors (12 female, 12 male) to offer the performance with good quality and...
- Dataset
- JSON
Stanford Alpaca

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10 and CIFAR-100 datasets for image classification, and ImageNet-100...
- Dataset
- JSON
MineCLIP

The MineCLIP dataset is a large-scale dataset of Minecraft demonstrations.
- Dataset
- JSON
GenRL

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a combination of reinforcement learning and generative models to solve...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

43 datasets found