-
Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Re...
Implicit Discourse Relation Recognition (IDRR), which infers discourse relations without the help of explicit connectives, is still a crucial and challenging task for discourse... -
QQP Dataset
The QQP dataset contains more than 400k question pairs. -
Penn Tree Bank
The Penn Tree Bank dataset is a corpus split into a training, validation and testing set of 929k words, a validation set of 73k words, and a test set of 82k words. The... -
Self-Recognition in Language Models
A self-recognition test for language models using model-generated security questions. -
Confidence Calibration in Large Language Models
The dataset used in this study to analyze the self-assessment behavior of Large language models. -
Xl-sum: Large-scale multilingual abstractive summarization
The Xl-sum dataset for multilingual abstractive summarization -
Cross-Lingual Ability of Multilingual BERT
The Cross-Lingual Ability of Multilingual BERT dataset -
Multilingual Language Models
The dataset used in this paper for multilingual language models -
SST-2, SNLI, and PubMed datasets
The dataset used in the paper is a collection of sentence classification tasks, including SST-2, SNLI, and PubMed. -
Corpus Pairs Dataset
Corpus pairs dataset for LABDet, a robust and language-agnostic bias probing method to quantify intrinsic bias in monolingual PLMs. -
Minimal Pairs Dataset
Minimal pairs dataset for LABDet, a robust and language-agnostic bias probing method to quantify intrinsic bias in monolingual PLMs. -
Sentiment Training Dataset
Sentiment training dataset for LABDet, a robust and language-agnostic bias probing method to quantify intrinsic bias in monolingual PLMs. -
GRASP: A Disagreement Analysis Framework to Assess Group Associations in Pers...
Human annotation plays a core role in machine learning — annotations for supervised models, safety guardrails for generative models, and human feedback for reinforcement... -
ChatGPT: A conversational AI model
The dataset used in the paper ChatGPT: A conversational AI model. -
Latent Distance Guided Alignment Training for Large Language Models
Ensuring alignment with human preferences is a crucial characteristic of large language models (LLMs). Presently, the primary alignment methods, RLHF and DPO, require extensive...