-
Towards Efficient Dialogue Pre-training with Transferable and Interpretable L...
This paper proposes a novel dialogue model with a latent structure that is easily transferable from the general domain to downstream tasks in a lightweight and transparent way. -
Low-resource knowledge-grounded dialogue generation
The dataset is used for low-resource knowledge-grounded dialogue generation, where the goal is to generate responses to context based on external knowledge. -
Incremental Transformer with Deliberation Decoder for Document Grounded Conve...
The dataset is used for document-grounded conversation, where the goal is to generate responses to context based on external knowledge. -
Topical-Chat: Towards knowledge-grounded open-domain conversations
The dataset is used for knowledge-grounded dialogue generation, where the goal is to generate responses to context based on external knowledge. -
Wizard of Wikipedia: Knowledge-powered conversational agents
The dataset is used for knowledge-grounded dialogue generation, where the goal is to generate responses to context based on external knowledge. -
Zero-Resource Knowledge-Grounded Dialogue Generation
The dataset is used for knowledge-grounded dialogue generation, where the goal is to generate responses to context based on external knowledge. -
OpenSubtitles dataset
Open-domain neural dialogue generation (Vinyals and Le, 2015; Sordoni et al., 2015; Li et al., 2016a; Mou et al., 2016; Serban et al., 2016a; Asghar et al., 2016; Mei et al.,... -
Wizard of Wikipedia
Wizard of Wikipedia is a recent, large-scale dataset of multi-turn knowledge-grounded dialogues between a “apprentice” and a “wizard”, who has access to information from... -
DailyDialog
The DailyDialog dataset is a large-scale multi-turn dialogue dataset, consisting of 10,000 conversations with 5 turns each. -
Commonsense Conversation Dataset
The Commonsense Conversation Dataset (CCD) is a dialogue generation dataset. -
Anthropic's HH-RLHF and OpenAI's summarization datasets
The dataset used in the paper is the Anthropic's HH-RLHF and OpenAI's summarization datasets.