Dataset Groups Activity Stream Wikitext-103 and LAMBADA datasets The dataset used in the paper is not explicitly mentioned, but it is mentioned that the authors trained a GPT2 transformer language model on the Wikitext-103 and LAMBADA datasets. BibTex: @dataset{Sheng_Shen_and_Pete_Walsh_and_Kurt_Keutzer_and_Jesse_Dodge_and_Matthew_Peters_and_Iz_Beltagy_2024, abstract = {The dataset used in the paper is not explicitly mentioned, but it is mentioned that the authors trained a GPT2 transformer language model on the Wikitext-103 and LAMBADA datasets.}, author = {Sheng Shen and Pete Walsh and Kurt Keutzer and Jesse Dodge and Matthew Peters and Iz Beltagy}, doi = {10.57702/h8pd5o1u}, institution = {No Organization}, keyword = {'lambada', 'natural language processing', 'wikitext-103'}, month = {dec}, publisher = {TIB}, title = {Wikitext-103 and LAMBADA datasets}, url = {https://service.tib.eu/ldmservice/dataset/wikitext-103-and-lambada-datasets}, year = {2024} }