-
Caltech Pedestrian
The dataset used in the paper is a video prediction dataset with occlusions, which is used to evaluate the proposed Fast Fourier Inception Networks (FFINet) for occluded video... -
MovingMNIST
MovingMNIST is a synthetic dataset for predicting the movement of two digits. -
Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
Video prediction is a challenging task. The quality of video frames from current state-of-the-art (SOTA) generative models tends to be poor and generalization beyond the... -
Decomposing motion and content for natural video sequence prediction
Decomposing motion and content for natural video sequence prediction. -
SDC-Net: Video prediction using spatially-displaced convolution
We present an approach for high-resolution video frame pre-diction by conditioning on both past frames and past optical flows. -
Moving MNIST
Moving MNIST is a benchmark data set for video recognition. There are 10,000 samples including 8,000 for training and 2,000 for test. Each sample consists of 20 sequential gray... -
Sdcnet: Video prediction using spatially-displaced convolution
Spatially-displaced convolution for video prediction.