Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Video Understanding Organizations: No Organization Formats: JSON Filter Results TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-... TOPA is a text-only pre-alignment framework for extending large language models for video understanding without the need for pre-training on real video data. Dataset JSON