-
sentencesBooks dataset
A collection of sentences from literature books (sentencesBooks), containing 56,557 labels and 2,400 total words. -
sentencesInternet dataset
A collection of sentences collected from the Internet (sentencesInternet), containing 85,941 labels and 4,800 total words. -
Literature de jeunesse libre (LjL) dataset
Literature de jeunesse libre (LjL) dataset, containing 334,026 labels and 2,060 total words. -
Experiments with GPT-3-based difficulty estimation
The dataset used for the experiments, containing three datasets: Literature de jeunesse libre (LjL), a collection of sentences collected from the Internet (sentencesInternet),...