-
Modeling Task Interactions in Document-Level Joint Entity and Relation Extrac...
Document-level relation extraction in an end-to-end setting, where the model needs to jointly perform mention extraction, coreference resolution (COREF) and relation extraction... -
Room-to-Room (R2R) dataset
The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three... -
TruthfulQA
The TruthfulQA dataset is a dataset that contains 817 questions designed to evaluate language models' preference to mimic some human falsehoods. -
Simplifying graph convolutional networks
Simplifying graph convolutional networks. -
StackLLaMA: An RL fine-tuned LLaMA model for Stack Exchange question and answ...
The dataset used in the paper is the StackExchange dataset. -
Symbolic, Language Agnostic and Ontologically Grounded Large Language Models
The dataset used in the paper to demonstrate the limitations of large language models (LLMs) in capturing inferential aspects of natural language. -
SimpleQuestion
The SimpleQuestion dataset is a dataset for question answering, consisting of 100,000 questions and 1,000,000 answers. -
REVERIE dataset
The REVERIE dataset is a dataset of household tasks in an indoor environment. It contains images annotated with natural language instructions including the referring expressions... -
Pandalm Dataset
The dataset used to train Pandalm, a generative safety evaluator for Chinese. -
Auto-J Dataset
The dataset used to train Auto-J, a generative safety evaluator for English. -
Jade Dataset
The dataset used to train Jade, a linguistic-based safety evaluation platform for Chinese. -
ShieldLM Dataset
The dataset used to train ShieldLM, a generative safety evaluator for English. -
SAFETY-J Dataset
The dataset used to train SAFETY-J, a bilingual generative safety evaluator for English and Chinese.