-
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Neural sequence models can generate highly fluent sentences, but recent studies have also shown that they are also prone to hallucinate additional content not supported by the... -
Machine Translation and Automated Analysis of the Sumerian Language Dataset
The Machine Translation and Automated Analysis of the Sumerian Language dataset, which contains Sumerian texts in cuneiform script. -
Sumerian Cuneiform Dataset
The dataset used for the study of Sumerian cuneiform, including part-of-speech tagging, named entity recognition, and machine translation. -
Intrinsic Dimensions of Language Fractal Structures
The dataset consists of embeddings of all n-grams of a natural language, constituting a representative sample of a language fractal structure. -
SNLI dataset
The dataset used in the paper is the SNLI dataset. -
Linear-time minimum Bayes risk decoding with reference aggregation
Linear-time minimum Bayes risk decoding with reference aggregation -
Finetuned language models are zero-shot learners
Finetuned language models are zero-shot learners -
Evaluating large language models trained on code
The paper presents the results of the OpenAI Codex evaluation on generating Python code. -
Improving Minimum Bayes Risk Decoding with Multi-Prompt
Multi-prompt decoding for conditional text generation -
ChatGPT and GPT-4
A dataset for evaluating the logical reasoning ability of chatgpt and gpt-4. -
A Joint Model for Definition Extraction with Syntactic Connection and Semantic...
Definition Extraction (DE) is one of the well-known topics in Information Extraction that aims to identify terms and their corresponding definitions in unstructured texts. -
Chimera dataset
The Chimera dataset is a ‘Chimera’ dataset of (Lazaridou et al., 2017). This dataset was specifically constructed to sim- ulate a nonce situation where a speaker encoun- ters a... -
TaxiXNLI (translated)
Multilingual extension of the TAXINLI dataset for analyzing the effects of reasoning types on cross-lingual transfer performance. -
TaxiXNLI (diagnostic)
Multilingual extension of the TAXINLI dataset for analyzing the effects of reasoning types on cross-lingual transfer performance. -
Corpus of Linguistic Acceptability (CoLA)
The Corpus of Linguistic Acceptability (CoLA) is a set of 10,657 English sentences labeled as grammatical or ungrammatical from published linguistics literature. -
Execution-based Evaluation for NL2Bash
A set of 50 prompts to evaluate execution-based evaluation for NL2Bash task -
Words2Contact
The Words2Contact dataset contains verbal instructions for humanoid robots to place support contacts. -
Word2Vec: A Novel Semi-Supervised Learning Approach for Word Embeddings
Word2Vec is a technique for learning vector representations of words in a text corpus. -
SimVerb-3500: A Large-Scale Evaluation Set of Verb Similarity
SimVerb-3500 is a large-scale evaluation set of verb similarity, providing human ratings for the similarity of 3,500 verb pairs.