RedPajama

The RedPajama dataset is an open-source recipe to reproduce the LLaMA training dataset.

BibTex: