EI2 model for text-driven video editing

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the DAVIS dataset and the Pexels website to gather face videos.

BibTex: