-
Mark Twain Books
A dataset of Mark Twain's books, used for testing the author-stylized text generation model. -
Opinosis Review Dataset
A dataset of Opinosis Review dataset, used for testing the author-stylized text generation model. -
Gutenberg Corpus
A dataset of 2,857 books written by 141 authors, used for pre-training and fine-tuning a language model for author-stylized text generation.