-
Improving open-domain dialogue systems via multi-turn incomplete utterance re...
Improving open-domain dialogue systems via multi-turn incomplete utterance restoration. -
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomple...
Incomplete utterance rewriting has recently raised wide attention. However, previous works do not consider the semantic structural information between incomplete utterance and... -
Stanford Multi-turn, Multi-domain Dialogue Dataset
The Stanford Multi-turn, Multi-domain Dialogue Dataset is a dataset for language understanding in task-oriented dialogue systems. It contains a large number of training... -
Airline Travel Information System dataset (ATIS)
The Airline Travel Information System dataset (ATIS) is a dataset for language understanding in task-oriented dialogue systems. It contains 4978 training utterances from Class A... -
Topical-Chat
The Topical-Chat dataset is a knowledge-grounded open-domain conversational dataset, which consists of dialogues between two Mechanical Turk workers (a.k.a. Turkers). -
Grounded response generation task at DSTC7
Grounded response generation task at DSTC7 -
Schema-Guided Dialogue
The Schema-Guided Dialogue (SGD) dataset contains over 20,000 multi-domain conversations between a human and a virtual assistant. -
MultiWOZ 2.0 and MultiWOZ 2.1
Dialogue state tracking (DST) aims at estimating the current dialogue state given all the preceding conversation. For multi-domain DST, the data sparsity problem is a major... -
Cambridge restaurant domain
The dataset used in the paper is the Cambridge restaurant domain from the PyDial toolkit. -
Semantically conditioned dialog response generation via hierarchical disentan...
Semantically conditioned dialog response generation via hierarchical disentangled self-attention. -
CRWIZ: A Framework for Crowdsourcing Real-Time Wizard-of-Oz Dialogues
A crowdsourced dialogue dataset for emergency response tasks, using a Wizard-of-Oz approach. -
Switchboard dataset
The dataset used in the paper is the Switchboard dataset, which contains telephone conversations. -
Persona-Chat
Persona-Chat is a dataset of human-like conversations. -
DailyDialog
The DailyDialog dataset is a large-scale multi-turn dialogue dataset, consisting of 10,000 conversations with 5 turns each.