-
ChatGPT: A conversational AI model
The dataset used in the paper ChatGPT: A conversational AI model. -
Mutual: A dataset for multi-turn dialogue reasoning
A dataset for multi-turn dialogue reasoning. -
Enhancing chat language models by scaling high-quality instructional conversa...
Enhancing chat language models by scaling high-quality instructional conversations. -
Polaris: A Safety-focused LLM Constellation for Healthcare
The Polaris dataset is a collection of conversations between a patient and a healthcare agent, with the goal of developing a safety-focused Large Language Model (LLM)... -
Chatbot Arena
The dataset used in this paper is a large-scale dataset for evaluating LLMs, which is used to train and evaluate the Chatbot Arena model. -
Arena-Hard
The dataset used in this paper is a large-scale dataset for evaluating LLMs, which is used to train and evaluate the Arena-Hard model. -
LMSYS ChatBot Arena
The dataset used in this paper is a large-scale real-world LLM conversation dataset, which is used to train and evaluate the LMSYS ChatBot Arena model. -
WizardArena
The dataset used in this paper is a large-scale conversational data, which is used to train and evaluate the WizardLM-β model. -
OpenAssistant Conversations– Democratizing Large Language Model Alignment
OpenAssistant Conversations– Democratizing Large Language Model Alignment -
E-commerce Dialogue Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
Douban Conversation Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
ConvAI2 persona-chat dataset
The ConvAI2 persona-chat dataset is an extended version of the persona-chat dataset, which contains conversations obtained from crowdworkers who were randomly paired and asked... -
User Reported Scenarios (URS) dataset
The User Reported Scenarios (URS) dataset is a collection of real-world use cases with 15 LLMs from a user study with 712 participants from 23 countries. -
Wizard of Wikipedia
Wizard of Wikipedia is a recent, large-scale dataset of multi-turn knowledge-grounded dialogues between a “apprentice” and a “wizard”, who has access to information from... -
STC dataset
The STC dataset is a short text conversation dataset used for evaluating the performance of conversation response generation models. -
OpenAssistant
The authors used the OpenAssistant dataset to construct evaluation datasets for their attacks. -
SIMMC: Situated Interactive Multi-Modal Conversational Data Collection and Ev...
SIMMC is an extension to ParlAI for multi-modal conversational data collection and system evaluation. It simulates an immersive setup, where crowd workers interact with...