Dataset - LDM

Dense regression network for video grounding

Dense regression network for video grounding
- Dataset
- JSON
Semantic conditioned dynamic modulation for temporal sentence grounding in vi...

Semantic conditioned dynamic modulation for temporal sentence grounding in videos
- Dataset
- JSON
Multilevel language and vision integration for text-to-clip retrieval

Multilevel language and vision integration for text-to-clip retrieval
- Dataset
- JSON
Tall: Temporal activity localization via language query

Tall: Temporal activity localization via language query.
- Dataset
- JSON
Support-Set Based Cross-Supervision for Video Grounding

Support-Set Based Cross-Supervision for Video Grounding
- Dataset
- JSON
Localizing moments in video with natural language

Localizing moments in video with natural language
- Dataset
- JSON
Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Vide...

Human-centric spatio-temporal video grounding (HC-STVG) task aims to localize a spatio-temporal tube of the target person indicated by a language description.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

7 datasets found