-
Linear-time minimum Bayes risk decoding with reference aggregation
Linear-time minimum Bayes risk decoding with reference aggregation -
Improving Minimum Bayes Risk Decoding with Multi-Prompt
Multi-prompt decoding for conditional text generation -
RateMyProfessor Dataset
RateMyProfessor dataset, a dataset of student-written reviews for professors. -
Bias in Bios Dataset
Bias in Bios dataset, a personal biography dataset with information extracted from Wikipedia. -
Reference Letter Dataset
Reference letter dataset generated under the Context-Based Generation (CBG) setting. -
Wikipedia Corpus
The dataset used in the paper is a subset of the Wikipedia corpus, consisting of 7500 English Wikipedia articles belonging to one of the following categories: People, Cities,... -
ChatGPT model data
ChatGPT model data, used to generate text -
Adding A Filter Based on The Discriminator to Improve Unconditional Text Gene...
The dataset is used for unconditional text generation, and the authors propose a novel mechanism to improve the generator by adding a filter which has the same input as the... -
Wikitext-2
The dataset used in this paper is not explicitly described. However, it is mentioned that the authors used the Wikitext-2 dataset for text generation tasks. -
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Diffusion models have emerged as a power-ful paradigm for generation, obtaining strong performance in various continuous domains. However, applying continuous diffusion models... -
Language models are few-shot learners
A language model that demonstrates capabilities in processing and generating human-like text. -
BookCorpus
The dataset used in this paper for unsupervised sentence representation learning, consisting of paragraphs from unlabeled text.