-
DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
The Video-to-Audio (V2A) model has recently gained attention for its practical application in generating audio directly from silent videos, particularly in video/film production. -
THE MIRRORNET: LEARNING AUDIO SYNTHESIZER CONTROLS INSPIRED BY SENSORIMOTOR I...
The MirrorNet model is applied to learn, in an unsupervised manner, the controls of a specific audio synthesizer (DIVA) to produce melodies only from their auditory spectrograms.