BIG-Bench Hard

doi:doi:10.57702/jczynul3

BIG-Bench Hard

The BIG-Bench Hard dataset is derived from the original BIG-Bench evaluation suite, focusing on tasks that pose challenges to existing language models.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Mirac Suzgun, Nathan Scales, Nathanael Scharli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed Huai hsin Chi, Denny Zhou, Jason Wei (2024). Dataset: BIG-Bench Hard. https://doi.org/10.57702/jczynul3

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2404.10500
Citation	https://doi.org/10.48550/arXiv.2307.10573
Author	Mirac Suzgun
More Authors	Nathan Scales Nathanael Scharli Sebastian Gehrmann Yi Tay Hyung Won Chung Aakanksha Chowdhery Quoc V. Le Ed Huai hsin Chi Denny Zhou Jason Wei
Homepage	https://big-bench.readthedocs.io/en/latest/