You're currently viewing an old version of this dataset. To see the current version, click here.

Google Sheets Dataset

The dataset is constructed from a corpus of Google Sheets publicly shared within our organization. We collected 46K Google Sheets with formulas, and split them into 42K for training, 2.3K for validation, and 1.7K for testing.

Data and Resources

Cite this as

Xinyun Chen, Petros Maniatis, Rishabh Singh, Charles Sutton, Hanjun Dai, Max Lin, Denny Zhou (2024). Dataset: Google Sheets Dataset. https://doi.org/10.57702/h0jtfih6

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Xinyun Chen
More Authors
Petros Maniatis
Rishabh Singh
Charles Sutton
Hanjun Dai
Max Lin
Denny Zhou
Homepage https://github.com/google-research/google-research/tree/master/spreadsheet_coder