-
Reddit Comments dataset
The Reddit Comments dataset is constructed from publicly available user comments on submissions on the Reddit website. -
Jewelry Shop Conversational Chatbot Dataset
The dataset used for the jewelry shop conversational chatbot, containing customer queries and responses. -
Alquist 3.0: Alexa Prize Bot Using Conversational Knowledge Graph
The third version of the socialbot Alquist, a conversational system designed to converse coherently and engagingly with humans on popular topics. -
Ubuntu Corpus
The dataset used in the paper is the Ubuntu Corpus, which consists of dialogues from the Ubuntu technical support chat. -
Baidu TieBa Corpus
The dataset used for context-oriented response selecting task, which is considered as a binary classification problem. -
Audrey Dataset
The dataset used for training the models in the Audrey chatbot. -
Audrey: A Personalized Open-Domain Conversational Bot
Audrey is an open-domain conversational chat-bot that aims to engage customers on informational, personal and relational levels through interest-driven conversations guided by... -
Towards a human-like open-domain chatbot
The dataset is used for open-domain human-machine conversation, where the goal is to generate responses to context. -
Building a conversational agent overnight with dialogue self-play
The Building a conversational agent overnight with dialogue self-play dataset is a benchmark for conversational AI. -
ATIS Intent Classification dataset
The dataset used in this paper is a noisy annotated dataset obtained from a zero-shot learner based module. -
Topical-Chat
The Topical-Chat dataset is a knowledge-grounded open-domain conversational dataset, which consists of dialogues between two Mechanical Turk workers (a.k.a. Turkers). -
Reddit conversation corpus
Reddit conversation corpus, consisting of data extracted from 95 top-ranked subreddits that discuss various topics such as sports, news, education and politics. -
Few-Shot Personalized Conversation Systems via Social Networks
A few-shot personalized conversation task with an auxiliary social network, catering for low-resource speakers. -
Conversational dataset
The conversational dataset is used to evaluate the performance of the proposed algorithms. The dataset consists of 20,000 questions and answers, where each question is answered... -
Empathetic Dialogue dataset
The Empathetic Dialogue dataset is a dataset of conversations related to daily life, each with an emotion label, a situation described in text, and a short two-party dialogue. -
SpeechBrain 1.0
SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker... -
Image-Chat: Engaging Grounded Conversations
Image-Chat dataset -
Reddit Conversation dataset
Reddit Conversation dataset