2 datasets found

Formats: JSON Tags: Dataset Creation

Filter Results
  • RedPajama

    The RedPajama dataset is an open-source recipe to reproduce the LLaMA training dataset.
  • LAION

    The dataset used in the paper is not explicitly described, but it is mentioned that it is a large-scale captioned image dataset (LAION) used to train the Stable Diffusion model.
You can also access this registry using the API (see API Docs).