MMX-Trailer-20 Dataset

Long form video understanding (LVU) is a sub-domain of video recognition concerned with understanding contextual information across contiguous shots which can contain multiple locations, scenes, interactions, and actions.

Data and Resources

Cite this as

Edward Fish, Jon Weinbren, Andrew Gilbert (2024). Dataset: MMX-Trailer-20 Dataset. https://doi.org/10.57702/aevzvt9w

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Edward Fish
More Authors
Jon Weinbren
Andrew Gilbert
Homepage https://arxiv.org/abs/2106.02036