Dataset - LDM

NLDD

NLDD dataset, a specialized open-source collection designed for the Natural Language to Software Generation domain.
- Dataset
- JSON
LLM Ethics Dataset

The dataset used in this study to explore the ethical issues surrounding Large Language Models (LLMs).
- Dataset
- JSON
MT-bench

The dataset used in the paper is MT-bench, which is an LLM-based automated evaluation metric comprising 80 challenging questions.
- Dataset
- JSON
Alpaca Eval 2

The dataset used in the paper is Alpaca Eval 2, which is an automated metric that measures LLMs' alignment with human preferences.
- Dataset
- JSON
CodeUltraFeedback

CodeUltraFeedback is a preference dataset of 10,000 complex instructions to tune and align LLMs to coding preferences through AI feedback.
- Dataset
- JSON
M2Chat: Empowering VLM for Multimodal LLM Interleaved

M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios.
- Dataset
- JSON
ChatGPT Dataset

The dataset used in this study consists of a large language model (LLM) enabled platform - ChatGPT.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

7 datasets found