Natural Language Processing - Groups

ParSEL: Parameterized Shape Editing with Language

ParSEL: Parameterized Shape Editing with Language, a system that enables controllable editing of 3D assets with natural language.

Dataset
JSON

Temporal Sentence Grounding in Videos

Temporal sentence grounding in videos (TSGV) is a task to retrieve a video segment that semantically corresponds to a query in natural language.

Dataset
JSON

APIBank

APIBank is a comprehensive benchmark for tool-augmented LLMs, focusing on API calling, retrieving, and planning abilities.

Dataset
JSON

APIBench

APIBench is a comprehensive benchmark for tool-augmented LLMs, focusing on API calling, retrieving, and planning abilities.

Dataset
JSON

GTA: A Benchmark for General Tool Agents

GTA is a benchmark for General Tool Agents, featuring three main aspects: real user queries, real deployed tools, and real multimodal inputs.

Dataset
JSON

LLMBI

The Large Language Model Bias Index (LLMBI) is a pio-neering approach designed to quantify and address biases inherent in large language models (LLMs), such as GPT-4.

Dataset
JSON

RateMyProfessor Dataset

RateMyProfessor dataset, a dataset of student-written reviews for professors.

Dataset
JSON

Bias in Bios Dataset

Bias in Bios dataset, a personal biography dataset with information extracted from Wikipedia.

Dataset
JSON

Language Agency Classification (LAC) Dataset

Language Agency Classification (LAC) dataset for training accurate language agency classifiers.

Dataset
JSON

Reference Letter Dataset

Reference letter dataset generated under the Context-Based Generation (CBG) setting.

Dataset
JSON

Language Agency Bias Evaluation (LABE)

Language Agency Bias Evaluation (LABE) framework to systematically and comprehensively measure gender, racial, and intersectional biases in language agency across a wide scope...

Dataset
JSON

S2ORC

A collection of 81.1 million scholarly publications in English from various academic fields, used to pre-train a language model.

Dataset
JSON

Towards a unified multi-dimensional evaluator for text generation

The NewsRoom dataset consists of 60 input source texts and 7 output summaries for each sample.

Dataset
JSON

Of human criteria and automatic metrics: A benchmark of the evaluation of sto...

The HANNA dataset contains 1056 creative story writings generated from 96 prompts collected from WritingPrompt.

Dataset
JSON

A general theoretical paradigm to understand learning from human preferences

The paper proposes a novel approach to aligning language models with human preferences, focusing on the use of preference optimization in reward-free RLHF.

Dataset
JSON

RRSIS

Referring Remote Sensing Image Segmentation (RRSIS) is a new challenge that combines computer vision and natural language processing, delineating specific regions in aerial...

Dataset
JSON

RRSIS-D

Referring Remote Sensing Image Segmentation (RRSIS) is a new challenge that combines computer vision and natural language processing, delineating specific regions in aerial...

Dataset
JSON

Towards Answering Climate Questionnaires

Two new large-scale climate questionnaire datasets, CLIMA-CDP and CLIMA-INS, are introduced. The datasets are composed of semi-structured questionnaires from different...

Dataset
JSON

DALL-E3 and Stable Diffusion Dataset

A dataset used by the authors to test their hypothesis about the white bear phenomenon in large models.

Dataset
JSON

White Bear Phenomenon Dataset

A dataset generated by the authors to test their hypothesis about the white bear phenomenon in large models.

Dataset
JSON

530 datasets found