-
RedPajama Dataset
The RedPajama dataset is used for single-turn dialogue task. -
Topical-Chat
The Topical-Chat dataset is a knowledge-grounded open-domain conversational dataset, which consists of dialogues between two Mechanical Turk workers (a.k.a. Turkers). -
SummEval and Topical-Chat
This paper uses SummEval and Topical-Chat datasets for evaluating the quality of summaries and responses. -
E-commerce Dialogue Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
Douban Conversation Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
Multi-Turn Dialogue Reasoning
A dataset for multi-turn dialogue reasoning -
DialogConv: A Lightweight Fully Convolutional Network for Multi-view Response...
A lightweight fully convolutional network for multi-view response selection -
EmpatheticDialogues
The EmpatheticDialogues dataset is a text dataset for training empathetic AI chatbots, consisting of 25k conversations grounded in emotional situations with emotion labels. -
Ubuntu Dialogue Corpus
The Ubuntu Dialogue Corpus is the largest freely available multi-turn based dialogue corpus which consists of almost one million two-way conversations extracted from the Ubuntu...