DOLLY dataset

Diffusion-based language models are emerg-ing as a promising alternative to autoregressive LMs: they approach the competence of autoregressive LMs while offering nuanced controlla-bility at inference time.

Data and Resources

Cite this as

Xiaochuang Han, Yulia Tsvetkov, Sachin Kumar, Marjan Ghazvininejad (2024). Dataset: DOLLY dataset. https://doi.org/10.57702/jg83obu0

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2305.14771
Author Xiaochuang Han
More Authors
Yulia Tsvetkov
Sachin Kumar
Marjan Ghazvininejad
Homepage https://huggingface.co/datasets/databricks/databricks-dolly-15k