-
ParSEL: Parameterized Shape Editing with Language
ParSEL: Parameterized Shape Editing with Language, a system that enables controllable editing of 3D assets with natural language. -
Temporal Sentence Grounding in Videos
Temporal sentence grounding in videos (TSGV) is a task to retrieve a video segment that semantically corresponds to a query in natural language. -
GTA: A Benchmark for General Tool Agents
GTA is a benchmark for General Tool Agents, featuring three main aspects: real user queries, real deployed tools, and real multimodal inputs. -
RateMyProfessor Dataset
RateMyProfessor dataset, a dataset of student-written reviews for professors. -
Bias in Bios Dataset
Bias in Bios dataset, a personal biography dataset with information extracted from Wikipedia. -
Language Agency Classification (LAC) Dataset
Language Agency Classification (LAC) dataset for training accurate language agency classifiers. -
Reference Letter Dataset
Reference letter dataset generated under the Context-Based Generation (CBG) setting. -
Language Agency Bias Evaluation (LABE)
Language Agency Bias Evaluation (LABE) framework to systematically and comprehensively measure gender, racial, and intersectional biases in language agency across a wide scope... -
Towards a unified multi-dimensional evaluator for text generation
The NewsRoom dataset consists of 60 input source texts and 7 output summaries for each sample. -
Of human criteria and automatic metrics: A benchmark of the evaluation of sto...
The HANNA dataset contains 1056 creative story writings generated from 96 prompts collected from WritingPrompt. -
A general theoretical paradigm to understand learning from human preferences
The paper proposes a novel approach to aligning language models with human preferences, focusing on the use of preference optimization in reward-free RLHF. -
Towards Answering Climate Questionnaires
Two new large-scale climate questionnaire datasets, CLIMA-CDP and CLIMA-INS, are introduced. The datasets are composed of semi-structured questionnaires from different... -
DALL-E3 and Stable Diffusion Dataset
A dataset used by the authors to test their hypothesis about the white bear phenomenon in large models. -
White Bear Phenomenon Dataset
A dataset generated by the authors to test their hypothesis about the white bear phenomenon in large models.