ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

A music generation model that leverages free-form text as a conditioning factor, utilizing the diffusion model to generate waveform-based music.

BibTex: