Heartheflow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

doi:doi:10.57702/lt5egsip

Heartheflow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

Learning to localize the sound source in videos without explicit annotations is a novel area of audio-visual research. Existing work in this area focuses on creating attention maps to capture the correlation between the two modalities to localize the source of the sound.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Dennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju (2024). Dataset: Heartheflow: Optical Flow-Based Self-Supervised Visual Sound Source Localization. https://doi.org/10.57702/lt5egsip

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Author	Dennis Fedorishin
More Authors	Deen Dayal Mohan Bhavin Jawade Srirangaraj Setlur Venu Govindaraju
Homepage	https://github.com/denfed/heartheflow