Dataset - LDM

ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Di...

Image-grounded dialogue generation in zero-resource scenarios
- Dataset
- JSON
Mutan: Multimodal Tucker Fusion for Visual Question Answering

The dataset used in the paper is a collection of images and corresponding referring expressions.
- Dataset
- JSON
Multimodal Information Fusion for Urban Scene Understanding

A dataset for urban scene understanding.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

Before browse our site, please accept our cookies policy