1 dataset found

Formats: JSON Tags: OpenWebText

Filter Results
  • OpenWebText Corpus

    A dataset for language modeling, where the goal is to predict the next word in a sequence given the previous words.
You can also access this registry using the API (see API Docs).