-
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Und...
TimeChat is a time-sensitive multimodal large language model specifically designed for long video understanding. It incorporates two key architectural contributions: a... -
VideoStreaming
A novel approach to tackle the complexities of long video understanding with large language models (LLMs). Our proposed memory-propagated streaming encoding architecture...