13 datasets found

Groups: Conversational AI

Filter Results
  • PARRY

    A chatbot designed by Colby et al. to imitate aggressive emotions.
  • mT5

    A multilingual version of the seq2seq architecture trained on Colossal Clean Crawled Corpus.
  • RetGen

    A hybrid retrieval-augmented/grounded version of the seq2seq architecture.
  • DLGNet

    A multi-turn dialogue response generator that was evaluated using automatic metrics.
  • Meena

    A multi-turn open-domain conversational AI seq2seq model that was trained end-to-end.
  • Reddit Comments dataset

    The Reddit Comments dataset is constructed from publicly available user comments on submissions on the Reddit website.
  • DICES-350

    The DICES-350 dataset is a curated sample of 8k multi-turn conversation corpus generated by human agents interacting with a generative AI-chatbot (Thoppilan et al., 2022) in an...
  • ChatGPT: A conversational AI model

    The dataset used in the paper ChatGPT: A conversational AI model.
  • E-commerce Dialogue Corpus

    The dataset is used for training and testing response selection models for multi-turn conversations.
  • Douban Conversation Corpus

    The dataset is used for training and testing response selection models for multi-turn conversations.
  • DailyDialog

    The DailyDialog dataset is a large-scale multi-turn dialogue dataset, consisting of 10,000 conversations with 5 turns each.
  • EmpatheticDialogues

    The EmpatheticDialogues dataset is a text dataset for training empathetic AI chatbots, consisting of 25k conversations grounded in emotional situations with emotion labels.
  • Ubuntu Dialogue Corpus

    The Ubuntu Dialogue Corpus is the largest freely available multi-turn based dialogue corpus which consists of almost one million two-way conversations extracted from the Ubuntu...