-
Estimating visual information from audio through manifold learning
Estimating visual information from audio through manifold learning. -
Visual to sound: Generating natural sound for videos in the wild
Visual to sound: Generating natural sound for videos in the wild. -
Deep cross-modal audio-visual generation
Deep cross-modal audio-visual generation. -
Sound2Scene
Sound2Scene is a sound-to-image generative model and training procedure that addresses the challenges of dealing with the large gaps that often exist between sight and sound.