LAMBADA

doi:doi:10.57702/vh6wq2i6

LAMBADA

The dataset used in the paper is a corpus of text containing approximately 10,000 examples, each a sequence of sentences extracted from books.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Denis Paperno, Germán Kruszewski, Angeliki Lazaridou, Quan Ngoc Pham, Raffaella Bernardi, Sandro Pezzelle, Marco Baroni, Gemma Boleda, Raquel Fernández (2024). Dataset: LAMBADA. https://doi.org/10.57702/vh6wq2i6

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.1803.08983
Citation	https://doi.org/10.48550/arXiv.2401.12819 https://doi.org/10.48550/arXiv.2305.12356
Author	Denis Paperno
More Authors	Germán Kruszewski Angeliki Lazaridou Quan Ngoc Pham Raffaella Bernardi Sandro Pezzelle Marco Baroni Gemma Boleda Raquel Fernández
Homepage	https://huggingface.co/datasets/LAMBADA