-
Pyramidal Predictive Network: A Model for Visual-frame Prediction Based on Pr...
Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames. -
Implicit Stacked Autoregressive Model for Video Prediction
has been frame Future prediction approached through two primary methods: autoregressive and non-autoregressive. Autoregressive methods rely on the Markov assumption and can... -
Accurate Grid Keypoint Learning for Efficient Video Prediction
Video prediction methods generally consume substantial computing resources in training and deployment, among which keypoint-based approaches show promising improvement in... -
Caltech Pedestrian
The dataset used in the paper is a video prediction dataset with occlusions, which is used to evaluate the proposed Fast Fourier Inception Networks (FFINet) for occluded video... -
MovingMNIST
MovingMNIST is a synthetic dataset for predicting the movement of two digits. -
Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
Video prediction is a challenging task. The quality of video frames from current state-of-the-art (SOTA) generative models tends to be poor and generalization beyond the... -
Decomposing motion and content for natural video sequence prediction
Decomposing motion and content for natural video sequence prediction. -
SDC-Net: Video prediction using spatially-displaced convolution
We present an approach for high-resolution video frame pre-diction by conditioning on both past frames and past optical flows. -
Moving MNIST
Moving MNIST is a benchmark data set for video recognition. There are 10,000 samples including 8,000 for training and 2,000 for test. Each sample consists of 20 sequential gray... -
Sdcnet: Video prediction using spatially-displaced convolution
Spatially-displaced convolution for video prediction.