-
HAIM-MIMIC-MM
Multimodal clinical dataset for healthcare applications -
Generalized K-fan Multimodal Deep Model with Shared Representations
Multimodal learning with deep Boltzmann machines (DBMs) is an generative approach to fuse multimodal inputs, and can learn the shared representation via Contrastive Divergence... -
Multimodal WBC dataset for WBC classification
A multimodal WBC dataset for WBC classification, consisting of four modalities and five classes. -
AVEC2019 DDS
AVEC2019 DDS is a benchmark dataset for depression detection. -
CubeMLP: An MLP-based Model for Multimodal Sentiment Analysis and Depression ...
Multimodal sentiment analysis and depression estimation are two important research topics that aim to predict human mental states using multimodal data. -
LLaVA-Instruct-150k
Visual question answering dataset -
Epic Kitchens dataset
Epic Kitchens dataset is a dataset for egocentric vision and action recognition. -
SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets
Geospatial datasets are diverse, naturally spatiotemporal, and inherently multimodal (composed of two or more distinct signal types or modalities) e.g., satellite/aerial imagery... -
Car Pedestrian Interaction (CPI) dataset
The authors present a synthetic Car Pedestrian Interaction (CPI) dataset for evaluating multimodal future predictions. -
BraTS 2019 validation and testing datasets
The BraTS 2019 validation and testing datasets are used to evaluate the performance of the proposed segmentation method. -
BraTS 2019 training dataset
Multimodal brain tumor segmentation challenge (BraTS) aims to evaluate state-of-the-art methods for the segmentation of brain tumors by providing a 3D MRI dataset with ground... -
MulRan: Multimodal Range Dataset for Urban Place Recognition
The MulRan dataset is a multimodal range dataset for urban place recognition, containing data collected from a scanning Navtech radar in various weather conditions. -
Multimodal Meme Dataset (MultiOFF)
Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text. -
Align before Attend: Aligning Visual and Textual Features for Multimodal Hate...
Multimodal hateful content detection is a challenging task that requires complex reasoning across visual and textual modalities.