Language Models - Groups - LDM

Context versus Prior Knowledge in Language Models

The dataset used in the paper to test the persuasion and susceptibility scores of language models.
- Dataset
- JSON
AdvBench dataset

The dataset used for the experiments in the paper, consisting of 60 harmful instructions from the AdvBench dataset.
- Dataset
- JSON
HH-RLHF

The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback.
- Dataset
- JSON
Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture

Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture
- Dataset
- JSON
Training Language Models to Perform Tasks

A dataset for training language models to perform tasks such as question answering and text classification.
- Dataset
- JSON

Before browse our site, please accept our cookies policy