-
LLM Ethics Dataset
The dataset used in this study to explore the ethical issues surrounding Large Language Models (LLMs). -
Alpaca Eval 2
The dataset used in the paper is Alpaca Eval 2, which is an automated metric that measures LLMs' alignment with human preferences. -
CodeUltraFeedback
CodeUltraFeedback is a preference dataset of 10,000 complex instructions to tune and align LLMs to coding preferences through AI feedback. -
M2Chat: Empowering VLM for Multimodal LLM Interleaved
M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios. -
ChatGPT Dataset
The dataset used in this study consists of a large language model (LLM) enabled platform - ChatGPT.