Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

Video prediction is a challenging task. The quality of video frames from current state-of-the-art (SOTA) generative models tends to be poor and generalization beyond the training data is difficult.

Data and Resources

Cite this as

Vikram Voleti, Alexia Jolicoeur-Martineau, Christopher Pal (2024). Dataset: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation. https://doi.org/10.57702/g64324dz

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2205.09853
Author Vikram Voleti
More Authors
Alexia Jolicoeur-Martineau
Christopher Pal
Homepage https://mask-cond-video-diffusion.github.io