-
Wired Explaining Dialogue Corpus
A manually annotated corpus for studying how humans explain in dialogical settings -
DailyDialog: A Manually Labelled Multi-Turn Dialogue Dataset
DailyDialog: A manually labelled multi-turn dialogue dataset. -
IEMOCAP dataset
The IEMOCAP dataset contains five recording sessions, each with one male speaker and one female speaker. -
Learning to speak and act in a fantasy text adventure game
A dataset of text-adventure game dialogues, including fantasy and horror games. -
Dialogue State Tracking Challenge 4 (DSTC4)
The proposed model proposes an end-to-end attentional role-based contextual model that automatically learns speaker-specific contextual encoding and investigates various... -
Google Assistant Dataset
The dataset used in this paper is a large-scale conversation dataset, generated by human evaluators, with a total of 20K conversations. -
Doctor-Patient Conversations Corpus
The dataset used in this paper is a corpus of nearly 7,000 doctor-patient conversations. -
Dialogue Dataset for Detecting Sentences that Do Not Require Factual Correctn...
A dialogue dataset annotated with fact-check-needed label (DDFC) for detecting sentences that do not require factual correctness judgment -
DBDC4 Japanese
The DBDC4 Japanese dataset contains dialogues from three dialogue systems named DCM, DIT, and IRS, and five other dialogue systems (IRS, MMK, MRK, TRF, and ZNK) which... -
Ubuntu Dialogue Corpus (UDC)
The Ubuntu Dialogue Corpus (UDC) dataset was extracted from the Ubuntu Relay Chat Channel. Although the topics in the dataset are not as diverse as in the MTC, the dataset is... -
Movie Triples Corpus (MTC)
The Movie Triples Corpus (MTC) dataset was derived from the Movie-DiC dataset by Banchs (2012). Although this dataset spans a wide range of topics with few spelling mistakes,... -
MultiWOZ-DF
A dataflow implementation of the MultiWOZ dataset -
Generation-based Conversation Dataset
The dataset used in the paper is another dataset containing 1.6 million query-reply pairs for generation-based conversation systems. -
Retrieval-based Conversation Dataset
The dataset used in the paper is a large database of query-reply pairs for retrieval-based conversation systems, containing 7 million query-reply pairs.