DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

doi:doi:10.57702/7na6lyzx

DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

The Video-to-Audio (V2A) model has recently gained attention for its practical application in generating audio directly from silent videos, particularly in video/film production.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Simian Luo, Chuanhao Yan, Chenxu Hu, Hang Zhao (2024). Dataset: DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models. https://doi.org/10.57702/7na6lyzx

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Author	Simian Luo
More Authors	Chuanhao Yan Chenxu Hu Hang Zhao
Homepage	https://diff-foley.github.io/