Dataset - LDM

NeurIPS, AAN, NSF Abstracts

NeurIPS, AAN, NSF Abstracts
- Dataset
- JSON
The Pile

The Pile dataset contains 3.5 million samples of diverse text for language modeling.
- Dataset
- JSON
Event Location Dataset

A dataset of around 8,000 labeled sentences in English, each of which is annotated with an event verb and its corresponding location or locations.
- Dataset
- JSON
FLICKR-25K

The dataset used for cross-modal hashing task, containing image and text data.
- Dataset
- JSON
Wiki4

The dataset used for cross-modal hashing task, containing image and text data.
- Dataset
- JSON
MoMu

The MoMu dataset is a molecular graph-text pairs dataset, constructed from scientific articles.
- Dataset
- JSON
MoleculeNet

The MoleculeNet dataset is a collection of molecular property prediction tasks. It contains 17 datasets, each with a different type of molecular graph.
- Dataset
- JSON
Trafﬁcking-10k

The Trafﬁcking-10k dataset contains more than 10,000 advertisements annotated for the task of detecting human trafﬁcking. The dataset contains two sources of information per...
- Dataset
- JSON
AudioCaps

Audio-text retrieval aims at retrieving a target audio clip or caption from a pool of candidates given a query in another modality.
- Dataset
- JSON
LSMDC

The LSMDC movie description dataset consists of 118,081 short video clips extracted from 202 movies, each annotated with a single caption.
- Dataset
- JSON
MSVD

Text-Video Retrieval (TVR) aims to align relevant video content with natural language queries. To date, most state-of-the-art TVR methods learn image-to-video transfer learning...
- Dataset
- JSON
ActivityNet Captions

The ActivityNet Captions is a benchmark dataset proposed for dense video captioning. There are 20K untrimmed videos in total, and each video has several annotated segments with...
- Dataset
- JSON
MSR-VTT

The dataset used in the paper is MSR-VTT, a large video description dataset for bridging video and language. The dataset contains 10k video clips with length varying from 10 to...
- Dataset
- JSON
MMVet Dataset

The dataset used for testing the Vary-base model, containing MMVet dataset.
- Dataset
- JSON
DocVQA and ChartQA Datasets

The dataset used for testing the Vary-base model, containing DocVQA and ChartQA datasets.
- Dataset
- JSON
Document-Level OCR Dataset

The dataset used for testing the Vary-base model, containing document-level OCR test set.
- Dataset
- JSON
Natural Image-Text Dataset

The dataset used for training the Vary-base model, containing natural image-text pairs.
- Dataset
- JSON
Document and Chart Dataset

The dataset used for training the new vision vocabulary network, containing high-resolution document and chart images with corresponding text.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

18 datasets found