-
Douban Conversation Corpus
The dataset is used for training and testing response selection models for multi-turn conversations. -
Multi-Turn Dialogue Reasoning
A dataset for multi-turn dialogue reasoning -
DialogConv: A Lightweight Fully Convolutional Network for Multi-view Response...
A lightweight fully convolutional network for multi-view response selection -
ReferItGame
Visual grounding is the task of localizing a language query in an image. The output is often a bounding box as drawn in the yellow color. -
Flickr30K Entities
The Flickr30K Entities dataset consists of 31,783 images each matched with 5 captions. The dataset links distinct sentence entities to image bounding boxes, resulting in 70K... -
Vision-and-Language Navigation
The Vision-and-Language Navigation (VLN) task gives a global natural sentence I = {w0,..., wl} as an instruction, where wi is a word token while the l is the length of the... -
From Detection of Toxic Spans in Online Discussions to Analysis of Toxic-to-C...
The ToxicSpans dataset is a subset of the Civil Comments dataset, containing toxic spans. -
Hate Speech Detection using Large Language Models
The dataset used for probing LLMs for hate speech detection, including HateXplain, implicit hate, and ToxicSpans datasets. -
Visual instruction tuning
Visual instruction tuning. -
Flamingo: a visual language model for few-shot learning
Flamingo: a visual language model for few-shot learning. -
Audio-visual scene-aware dialog
Audio-visual scene-aware dialog. -
ChatBridge
ChatBridge is a multimodal language model capable of perceiving real-world multimodal information, as well as following instructions, thinking, and interacting with humans in... -
ClinicalLab: A Comprehensive Clinical Diagnosis Agent Alignment Suite
Large language models (LLMs) have achieved significant performance progress in various natural language processing applications. However, LLMs still struggle to meet the strict... -
TruthX: Alleviating Hallucinations by Editing Large Language Models
TruthX: Alleviating Hallucinations by Editing Large Language Models -
StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure
This work presents StrAE: a Structured Autoencoder framework that through strict adherence to explicit structure, and use of a novel contrastive objective over tree-structured... -
ShapeNeRF–Text
The ShapeNeRF–Text dataset consists of 40K paired NeRFs and language annotations for ShapeNet objects. -
Wikitext-2
The dataset used in this paper is not explicitly described. However, it is mentioned that the authors used the Wikitext-2 dataset for text generation tasks.