Language Models with Image Descriptors

The Language Models with Image Descriptors dataset, which is used for evaluating the performance of the InstructVid2Vid model.

BibTex: