-
Jewelry Shop Conversational Chatbot Dataset
The dataset used for the jewelry shop conversational chatbot, containing customer queries and responses. -
Ubuntu Corpus
The dataset used in the paper is the Ubuntu Corpus, which consists of dialogues from the Ubuntu technical support chat. -
Baidu TieBa Corpus
The dataset used for context-oriented response selecting task, which is considered as a binary classification problem. -
Audrey Dataset
The dataset used for training the models in the Audrey chatbot. -
Audrey: A Personalized Open-Domain Conversational Bot
Audrey is an open-domain conversational chat-bot that aims to engage customers on informational, personal and relational levels through interest-driven conversations guided by... -
Towards a human-like open-domain chatbot
The dataset is used for open-domain human-machine conversation, where the goal is to generate responses to context. -
Building a conversational agent overnight with dialogue self-play
The Building a conversational agent overnight with dialogue self-play dataset is a benchmark for conversational AI. -
Topical-Chat
The Topical-Chat dataset is a knowledge-grounded open-domain conversational dataset, which consists of dialogues between two Mechanical Turk workers (a.k.a. Turkers). -
Reddit conversation corpus
Reddit conversation corpus, consisting of data extracted from 95 top-ranked subreddits that discuss various topics such as sports, news, education and politics. -
ChatGPT: A conversational AI model
The dataset used in the paper ChatGPT: A conversational AI model. -
ConvAI2 persona-chat dataset
The ConvAI2 persona-chat dataset is an extended version of the persona-chat dataset, which contains conversations obtained from crowdworkers who were randomly paired and asked... -
OpenAssistant
The authors used the OpenAssistant dataset to construct evaluation datasets for their attacks. -
DailyDialog
The DailyDialog dataset is a large-scale multi-turn dialogue dataset, consisting of 10,000 conversations with 5 turns each. -
EmpatheticDialogues
The EmpatheticDialogues dataset is a text dataset for training empathetic AI chatbots, consisting of 25k conversations grounded in emotional situations with emotion labels.