DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

The Video-to-Audio (V2A) model has recently gained attention for its practical application in generating audio directly from silent videos, particularly in video/film production.

Data and Resources

Cite this as

Simian Luo, Chuanhao Yan, Chenxu Hu, Hang Zhao (2024). Dataset: DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models. https://doi.org/10.57702/7na6lyzx

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Simian Luo
More Authors
Chuanhao Yan
Chenxu Hu
Hang Zhao
Homepage https://diff-foley.github.io/