-
TruthfulQA
The TruthfulQA dataset is a dataset that contains 817 questions designed to evaluate language models' preference to mimic some human falsehoods. -
WIMCOR: A Large Harvested Corpus of Location Metonymy
WIMCOR is a large and rich dataset of location metonymy, extracted using Wikipedia. It is suitable for metonymy detection and entity linking tasks. -
Extracting Blockchain Concepts from Text
The dataset is used to extract information from whitepapers and academic articles focused on the blockchain area to organize this information and aid users to navigate the space. -
ACL Anthology Dataset
The ACL Anthology dataset contains 21,212 papers, 17,792 authors, 342 venues, and 110,975 citations. -
ACL Anthology
The ACL Anthology dataset contains papers on natural language processing, including citation patterns, authorship, and language use over time. -
Collecting and Characterizing Natural Language Utterances for Specifying Data...
A dataset of natural language utterances for specifying data visualizations. -
Semantic Profiling of Natural Language Utterances for Data Visualization Gene...
A dataset of 500 natural language utterances for data visualization generation, including utterances with uncertainties and missing data references. -
NL4Opt Generation Dataset
The NL4Opt Generation Dataset consists of 1101 examples, divided into the train, dev, and test splits composed of 713, 99, and 289 examples, respectively. Each example consists... -
Zh-En Multi-Domain Dataset
The Zh-En multi-domain dataset consists of four balanced domains: news, patent, subtitles, and COVID-19. -
XSUM Dataset
The XSUM dataset comprises 226,711 British Broadcasting Corporation (BBC) articles paired with their single-sentence summaries. -
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Neural sequence models can generate highly fluent sentences, but recent studies have also shown that they are also prone to hallucinate additional content not supported by the... -
Machine Translation and Automated Analysis of the Sumerian Language Dataset
The Machine Translation and Automated Analysis of the Sumerian Language dataset, which contains Sumerian texts in cuneiform script. -
Sumerian Cuneiform Dataset
The dataset used for the study of Sumerian cuneiform, including part-of-speech tagging, named entity recognition, and machine translation. -
Intrinsic Dimensions of Language Fractal Structures
The dataset consists of embeddings of all n-grams of a natural language, constituting a representative sample of a language fractal structure. -
SNLI dataset
The dataset used in the paper is the SNLI dataset. -
Linear-time minimum Bayes risk decoding with reference aggregation
Linear-time minimum Bayes risk decoding with reference aggregation -
Finetuned language models are zero-shot learners
Finetuned language models are zero-shot learners -
Evaluating large language models trained on code
The paper presents the results of the OpenAI Codex evaluation on generating Python code. -
Improving Minimum Bayes Risk Decoding with Multi-Prompt
Multi-prompt decoding for conditional text generation -
ChatGPT and GPT-4
A dataset for evaluating the logical reasoning ability of chatgpt and gpt-4.