-
Enron Corpus
The Enron corpus is a dataset of over 17K Excel Spreadsheets extracted from the Enron email corpus. -
Google Sheets Dataset
The dataset is constructed from a corpus of Google Sheets publicly shared within our organization. We collected 46K Google Sheets with formulas, and split them into 42K for... -
SPREADSHEETCODER: Formula Prediction from Semi-structured Context
Spreadsheet formula prediction has been an im-portant program synthesis problem with many real-world applications. Previous works typically utilize input-output examples as the...