1 dataset found

Tags: M4

Filter Results
  • M4

    The M4 dataset consists of human-written texts from several data sources, including Wikipedia, Reddit, and arXiv in the English subset of the dataset. It pairs the human-written...
You can also access this registry using the API (see API Docs).