Dataset - LDM

Xlnet: Generalized Autoregressive Pretraining for Language Understanding

The Xlnet is a generalized autoregressive pretraining model for language understanding.
- Dataset
- JSON
Stanford Multi-turn, Multi-domain Dialogue Dataset

The Stanford Multi-turn, Multi-domain Dialogue Dataset is a dataset for language understanding in task-oriented dialogue systems. It contains a large number of training...
- Dataset
- JSON
Airline Travel Information System dataset (ATIS)

The Airline Travel Information System dataset (ATIS) is a dataset for language understanding in task-oriented dialogue systems. It contains 4978 training utterances from Class A...
- Dataset
- JSON
Ref-DAVIS17

Ref-DAVIS17 is an extension of the DAVIS17 dataset, where it enhances the dataset by providing language descriptions for each specific object present in the videos.
- Dataset
- JSON
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Ob...

Referring video object segmentation (RVOS) aims to accurately segment the target object in the video with the guidance of given language expressions.
- Dataset
- JSON
MMLU

The dataset is used for instruction-tuning of LLMs in multiple languages using reinforcement learning from human feedback.
- Dataset
- JSON
C4

The dataset used for pre-training language models, containing a large collection of text documents.
- Dataset
- JSON
G-Ref

G-Ref is a dataset for referring image segmentation, comprising 104K referring language expressions for around 55K objects in about 27K images.
- Dataset
- JSON
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language

A Large Video Description Dataset for Bridging Video and Language.
- Dataset
- JSON
Ref-Youtube-VOS

Ref-Youtube-VOS is an extensive referring video object segmentation dataset that comprises approximately 15,000 referring expressions associated with more than 3,900 videos.
- Dataset
- JSON
RefCOCO

The dataset used in the paper is a benchmark for referring expression grounding, containing 142,210 referring expressions for 50,000 referents in 19,994 images.
- Dataset
- JSON
BERT: Pre-training of deep bidirectional transformers for language understanding

This paper proposes BERT, a pre-trained deep bidirectional transformer for language understanding.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

12 datasets found