Dataset - LDM

M4

The M4 dataset consists of human-written texts from several data sources, including Wikipedia, Reddit, and arXiv in the English subset of the dataset. It pairs the human-written...
- Dataset
- JSON
DF-LLM

The dataset used in this paper for digital forensic investigation using Large Language Models (LLMs).
- Dataset
- JSON
Nyu CTF Dataset

A scalable open-source benchmark dataset for evaluating LLMs in offensive security.
- Dataset
- JSON
An empirical evaluation of llms for solving offensive security challenges

An empirical evaluation of LLMs for solving offensive security challenges.
- Dataset
- JSON
Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. This objective can be achieved from three aspects: (i) high...
- Dataset
- JSON
ConstraintChecker

ConstraintChecker is a plugin component for LLMs to handle the problem of explicit relational constraint in CSKB reasoning.
- Dataset
- JSON
Monitoring CIFs During Disasters Using LLMs

The dataset used in this paper for monitoring Critical Infrastructure Facilities (CIFs) during disasters using Large Language Models (LLMs).
- Dataset
- JSON
LLMs for Social Robotics

The dataset used in the paper is not explicitly described, but it is mentioned that the authors recreated three existing HRI studies with LLMs.
- Dataset
- JSON
Mixtral of Experts

The dataset used in the paper for instruction following task
- Dataset
- JSON
Llama 2-7B-80k

The dataset used in the paper for instruction following task
- Dataset
- JSON
Mistral 7b

The dataset used in the paper for instruction following task
- Dataset
- JSON
AlpacaEval 2.0

The dataset used in the paper for instruction following task
- Dataset
- JSON
Evol-Instruct-70k

The dataset used in the paper for in-context learning task
- Dataset
- JSON
MoralChoice

The MoralChoice survey dataset contains 1767 moral decision-making scenarios. Every moral scenario consists of a triple (context, action 1, action 2) and a set of auxiliary labels.
- Dataset
- JSON
Okapi

The dataset is used for instruction-tuning of LLMs in multiple languages using reinforcement learning from human feedback.
- Dataset
- JSON
Gen2Sim

Gen2Sim generates 3D assets from object-centric images using image diffusion models and predicts physical parameters for them using LLMs.
- Dataset
- JSON
Forbidden Question Dataset

The dataset used to evaluate the effectiveness of different jailbreak attack methods against LLMs. The dataset contains 160 forbidden questions with high diversity.
- Dataset
- JSON
Jailbreak Attack Dataset

The dataset used in the paper to evaluate the effectiveness of different jailbreak attack methods against Large Language Models (LLMs).
- Dataset
- JSON
Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and...

A dataset of 29 multi-party conversations between patients, their companions, and a social robot in a hospital.
- Dataset
- JSON
EM-Assist: Safe Automated ExtractMethod Refactoring with LLMs

EM-Assist is an automated refactoring tool that uses LLMs to generate refactoring suggestions and validates, enhances, and ranks them.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

20 datasets found