Dialogue Systems - Groups

DIASPORA

The DIASPORA dataset is a human-human conversation dataset annotated with lexical aspect.

Dataset
JSON

Wired Explaining Dialogue Corpus

A manually annotated corpus for studying how humans explain in dialogical settings

Dataset
JSON

DialogZoo

A large-scale dialogue dataset with rich task diversity, collected to pre-train a unified dialogue foundation model.

Dataset
JSON

DailyDialog: A Manually Labelled Multi-Turn Dialogue Dataset

DailyDialog: A manually labelled multi-turn dialogue dataset.

Dataset
JSON

IEMOCAP dataset

The IEMOCAP dataset contains five recording sessions, each with one male speaker and one female speaker.

Dataset
JSON

GoRecDial

A conversational recommendation dataset released by Kang et al. This dataset was constructed using ParlAI to interface with Amazon Mechanical Turk (AMT) to reflect the movie...

Dataset
JSON

Learning to speak and act in a fantasy text adventure game

A dataset of text-adventure game dialogues, including fantasy and horror games.

Dataset
JSON

Dialogue State Tracking Challenge 4 (DSTC4)

The proposed model proposes an end-to-end attentional role-based contextual model that automatically learns speaker-speciﬁc contextual encoding and investigates various...

Dataset
JSON

Google Assistant Dataset

The dataset used in this paper is a large-scale conversation dataset, generated by human evaluators, with a total of 20K conversations.

Dataset
JSON

Doctor-Patient Conversations Corpus

The dataset used in this paper is a corpus of nearly 7,000 doctor-patient conversations.

Dataset
JSON

KVRET*

Dialogue contexts are proven helpful in the spoken language understanding (SLU) system and they are typically encoded with explicit memory representations. However, most of the...

Dataset
JSON

KVRET

Dialogue contexts are proven helpful in the spoken language understanding (SLU) system and they are typically encoded with explicit memory representations. However, most of the...

Dataset
JSON

Dialogue Dataset for Detecting Sentences that Do Not Require Factual Correctn...

A dialogue dataset annotated with fact-check-needed label (DDFC) for detecting sentences that do not require factual correctness judgment

Dataset
JSON

DBDC4 Japanese

The DBDC4 Japanese dataset contains dialogues from three dialogue systems named DCM, DIT, and IRS, and five other dialogue systems (IRS, MMK, MRK, TRF, and ZNK) which...

Dataset
JSON

DBDC4

The Fourth Dialogue Breakdown Detection Challenge (DBDC4) dataset contains dialogues from a dialogue system named IRIS and six other dialogue systems (anonymised as Bot001 to...

Dataset
JSON

Ubuntu Dialogue Corpus (UDC)

The Ubuntu Dialogue Corpus (UDC) dataset was extracted from the Ubuntu Relay Chat Channel. Although the topics in the dataset are not as diverse as in the MTC, the dataset is...

Dataset
JSON

Movie Triples Corpus (MTC)

The Movie Triples Corpus (MTC) dataset was derived from the Movie-DiC dataset by Banchs (2012). Although this dataset spans a wide range of topics with few spelling mistakes,...

Dataset
JSON

MultiWOZ-DF

A dataflow implementation of the MultiWOZ dataset

Dataset
JSON

Generation-based Conversation Dataset

The dataset used in the paper is another dataset containing 1.6 million query-reply pairs for generation-based conversation systems.

Dataset
JSON

Retrieval-based Conversation Dataset

The dataset used in the paper is a large database of query-reply pairs for retrieval-based conversation systems, containing 7 million query-reply pairs.

Dataset
JSON

81 datasets found