Heartheflow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

Learning to localize the sound source in videos without explicit annotations is a novel area of audio-visual research. Existing work in this area focuses on creating attention maps to capture the correlation between the two modalities to localize the source of the sound.

Data and Resources

Cite this as

Dennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju (2024). Dataset: Heartheflow: Optical Flow-Based Self-Supervised Visual Sound Source Localization. https://doi.org/10.57702/lt5egsip

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Dennis Fedorishin
More Authors
Deen Dayal Mohan
Bhavin Jawade
Srirangaraj Setlur
Venu Govindaraju
Homepage https://github.com/denfed/heartheflow