-
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech
The implicit hate dataset is a specialized collection of data aimed at detecting hate speech. -
Hatexplain: A Benchmark Dataset for Explainable Hate Speech Detection
The HateXplain dataset is a benchmark dataset for explainable hate speech detection. -
Hate Speech Detection using Large Language Models
The dataset used for probing LLMs for hate speech detection, including HateXplain, implicit hate, and ToxicSpans datasets. -
Visual instruction tuning
Visual instruction tuning. -
Flamingo: a visual language model for few-shot learning
Flamingo: a visual language model for few-shot learning. -
Audio-visual scene-aware dialog
Audio-visual scene-aware dialog. -
ChatBridge
ChatBridge is a multimodal language model capable of perceiving real-world multimodal information, as well as following instructions, thinking, and interacting with humans in... -
Switchboard Corpus
The Switchboard corpus is a dataset of speech recordings from a switchboard, which is a device that allows multiple people to speak at the same time. -
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Diffusion models have emerged as a power-ful paradigm for generation, obtaining strong performance in various continuous domains. However, applying continuous diffusion models... -
Language models are few-shot learners
A language model that demonstrates capabilities in processing and generating human-like text. -
Tensor Trust Dataset
A dataset of prompt injection attacks for evaluating the effectiveness of Tensor Trust in detecting prompt injection attacks. -
SPML Dataset
A dataset of system prompts and user prompts for evaluating the effectiveness of SPML in detecting prompt injection attacks. -
Natural Questions
The Natural Questions dataset consists of questions extracted from web queries, with each question accompanied by a corresponding Wikipedia article containing the answer. -
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
Multimodal models trained on large natural image-text pair datasets have exhibited astounding abilities in gener-ating high-quality images. Medical imaging data is fundamentally...