-
DailyDialog: A Manually Labelled Multi-Turn Dialogue Dataset
DailyDialog: A manually labelled multi-turn dialogue dataset. -
IEMOCAP dataset
The IEMOCAP dataset contains five recording sessions, each with one male speaker and one female speaker. -
MultiWOZ dataset
The dataset used in the paper is the MultiWOZ dataset, which is a human-human task-oriented dialogue dataset collected via the Wizard-of-Oz framework. It contains conversations... -
Taskmaster
Task-oriented dialogue datasets for training and evaluation of task-oriented dialogue models -
MultiWOZ 2.0, CamRest676, SMCalFlow
Task-oriented dialogue datasets for training and evaluation of task-oriented dialogue models -
Multi-Session Chat
The Multi-Session Chat dataset is used in the paper for evaluating the long-term memory of conversational agents. -
DBDC4 Japanese
The DBDC4 Japanese dataset contains dialogues from three dialogue systems named DCM, DIT, and IRS, and five other dialogue systems (IRS, MMK, MRK, TRF, and ZNK) which... -
MultiWOZ-DF
A dataflow implementation of the MultiWOZ dataset -
Improving open-domain dialogue systems via multi-turn incomplete utterance re...
Improving open-domain dialogue systems via multi-turn incomplete utterance restoration. -
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomple...
Incomplete utterance rewriting has recently raised wide attention. However, previous works do not consider the semantic structural information between incomplete utterance and... -
Stanford Multi-turn, Multi-domain Dialogue Dataset
The Stanford Multi-turn, Multi-domain Dialogue Dataset is a dataset for language understanding in task-oriented dialogue systems. It contains a large number of training... -
Airline Travel Information System dataset (ATIS)
The Airline Travel Information System dataset (ATIS) is a dataset for language understanding in task-oriented dialogue systems. It contains 4978 training utterances from Class A... -
Topical-Chat
The Topical-Chat dataset is a knowledge-grounded open-domain conversational dataset, which consists of dialogues between two Mechanical Turk workers (a.k.a. Turkers). -
Grounded response generation task at DSTC7
Grounded response generation task at DSTC7 -
Schema-Guided Dialogue
The Schema-Guided Dialogue (SGD) dataset contains over 20,000 multi-domain conversations between a human and a virtual assistant. -
Empathetic Dialogue dataset
The Empathetic Dialogue dataset is a dataset of conversations related to daily life, each with an emotion label, a situation described in text, and a short two-party dialogue.