-
Improving open-domain dialogue systems via multi-turn incomplete utterance re...
Improving open-domain dialogue systems via multi-turn incomplete utterance restoration. -
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomple...
Incomplete utterance rewriting has recently raised wide attention. However, previous works do not consider the semantic structural information between incomplete utterance and... -
Stanford Multi-turn, Multi-domain Dialogue Dataset
The Stanford Multi-turn, Multi-domain Dialogue Dataset is a dataset for language understanding in task-oriented dialogue systems. It contains a large number of training... -
Airline Travel Information System dataset (ATIS)
The Airline Travel Information System dataset (ATIS) is a dataset for language understanding in task-oriented dialogue systems. It contains 4978 training utterances from Class A... -
Topical-Chat
The Topical-Chat dataset is a knowledge-grounded open-domain conversational dataset, which consists of dialogues between two Mechanical Turk workers (a.k.a. Turkers). -
Grounded response generation task at DSTC7
Grounded response generation task at DSTC7 -
Schema-Guided Dialogue
The Schema-Guided Dialogue (SGD) dataset contains over 20,000 multi-domain conversations between a human and a virtual assistant. -
Empathetic Dialogue dataset
The Empathetic Dialogue dataset is a dataset of conversations related to daily life, each with an emotion label, a situation described in text, and a short two-party dialogue. -
MultiWOZ 2.0 and MultiWOZ 2.1
Dialogue state tracking (DST) aims at estimating the current dialogue state given all the preceding conversation. For multi-domain DST, the data sparsity problem is a major... -
Cambridge restaurant domain
The dataset used in the paper is the Cambridge restaurant domain from the PyDial toolkit. -
Semantically conditioned dialog response generation via hierarchical disentan...
Semantically conditioned dialog response generation via hierarchical disentangled self-attention. -
Textual Interface Driven Task-Oriented Dialogue Systems
Traditional end-to-end task-oriented dialogue systems have been built with a modularized design. However, such design often causes misalignment between the agent response and... -
CRWIZ: A Framework for Crowdsourcing Real-Time Wizard-of-Oz Dialogues
A crowdsourced dialogue dataset for emergency response tasks, using a Wizard-of-Oz approach. -
Switchboard Corpus
The Switchboard corpus is a dataset of speech recordings from a switchboard, which is a device that allows multiple people to speak at the same time. -
Switchboard dataset
The dataset used in the paper is the Switchboard dataset, which contains telephone conversations. -
Switchboard
Human speech data comprises a rich set of domain factors such as accent, syntactic and semantic variety, or acoustic environment.