BIG-Bench Hard

The BIG-Bench Hard dataset is derived from the original BIG-Bench evaluation suite, focusing on tasks that pose challenges to existing language models.

Data and Resources

Cite this as

Mirac Suzgun, Nathan Scales, Nathanael Scharli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed Huai hsin Chi, Denny Zhou, Jason Wei (2024). Dataset: BIG-Bench Hard. https://doi.org/10.57702/jczynul3

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2404.10500
Citation
  • https://doi.org/10.48550/arXiv.2307.10573
Author Mirac Suzgun
More Authors
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
Hyung Won Chung
Aakanksha Chowdhery
Quoc V. Le
Ed Huai hsin Chi
Denny Zhou
Jason Wei
Homepage https://big-bench.readthedocs.io/en/latest/