-
Reddit Comments dataset
The Reddit Comments dataset is constructed from publicly available user comments on submissions on the Reddit website. -
ChatGPT: A conversational AI model
The dataset used in the paper ChatGPT: A conversational AI model. -
E-commerce Dialogue Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
Douban Conversation Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
DailyDialog
The DailyDialog dataset is a large-scale multi-turn dialogue dataset, consisting of 10,000 conversations with 5 turns each. -
EmpatheticDialogues
The EmpatheticDialogues dataset is a text dataset for training empathetic AI chatbots, consisting of 25k conversations grounded in emotional situations with emotion labels. -
Ubuntu Dialogue Corpus
The Ubuntu Dialogue Corpus is the largest freely available multi-turn based dialogue corpus which consists of almost one million two-way conversations extracted from the Ubuntu...