Dataset - LDM

HAIM-MIMIC-MM

Multimodal clinical dataset for healthcare applications
- Dataset
- JSON
Generalized K-fan Multimodal Deep Model with Shared Representations

Multimodal learning with deep Boltzmann machines (DBMs) is an generative approach to fuse multimodal inputs, and can learn the shared representation via Contrastive Divergence...
- Dataset
- JSON
Multimodal WBC dataset for WBC classification

A multimodal WBC dataset for WBC classification, consisting of four modalities and five classes.
- Dataset
- JSON
AVEC2019 DDS

AVEC2019 DDS is a benchmark dataset for depression detection.
- Dataset
- JSON
CubeMLP: An MLP-based Model for Multimodal Sentiment Analysis and Depression ...

Multimodal sentiment analysis and depression estimation are two important research topics that aim to predict human mental states using multimodal data.
- Dataset
- JSON
MUTE

The MUTE dataset is a multimodal dataset for detecting hateful memes.
- Dataset
- JSON
MemoSEN

The MemoSEN dataset is a multimodal dataset for sentiment analysis of Bengali memes.
- Dataset
- JSON
LLaVA-Instruct-150k

Visual question answering dataset
- Dataset
- JSON
ReasonDet

Reasoning detection dataset for multimodal large language models
- Dataset
- JSON
BabelPic

The BabelPic dataset is a multimodal dataset for non-concrete concepts.
- Dataset
- JSON
Epic Kitchens dataset

Epic Kitchens dataset is a dataset for egocentric vision and action recognition.
- Dataset
- JSON
SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets

Geospatial datasets are diverse, naturally spatiotemporal, and inherently multimodal (composed of two or more distinct signal types or modalities) e.g., satellite/aerial imagery...
- Dataset
- JSON
Car Pedestrian Interaction (CPI) dataset

The authors present a synthetic Car Pedestrian Interaction (CPI) dataset for evaluating multimodal future predictions.
- Dataset
- JSON
ReasonSeg

The ReasonSeg dataset is a benchmark for reasoning segmentation tasks, which demands a nuanced comprehension of intricate queries to accurately pinpoint object regions.
- Dataset
- JSON
BraTS 2019 validation and testing datasets

The BraTS 2019 validation and testing datasets are used to evaluate the performance of the proposed segmentation method.
- Dataset
- JSON
BraTS 2019 training dataset

Multimodal brain tumor segmentation challenge (BraTS) aims to evaluate state-of-the-art methods for the segmentation of brain tumors by providing a 3D MRI dataset with ground...
- Dataset
- JSON
MulRan: Multimodal Range Dataset for Urban Place Recognition

The MulRan dataset is a multimodal range dataset for urban place recognition, containing data collected from a scanning Navtech radar in various weather conditions.
- Dataset
- JSON
NTU-RGBD

The NTU-RGBD dataset is a large-scale dataset for 3D human activity analysis, containing 56,000 videos and 60 actions performed by 40 people from 80 different views.
- Dataset
- JSON
Multimodal Meme Dataset (MultiOFF)

Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text.
- Dataset
- JSON
Align before Attend: Aligning Visual and Textual Features for Multimodal Hate...

Multimodal hateful content detection is a challenging task that requires complex reasoning across visual and textual modalities.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

43 datasets found