Video Prediction - Groups

Pyramidal Predictive Network: A Model for Visual-frame Prediction Based on Pr...

Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames.

Dataset
JSON

Human 3.6

The Human 3.6 dataset contains motion capture data of a person captured using a high-speed 3D camera.

Dataset
JSON

TrafﬁcBJ

The TrafﬁcBJ dataset is a collection of taxicab GPS data and meteorological data recorded in Beijing.

Dataset
JSON

Implicit Stacked Autoregressive Model for Video Prediction

has been frame Future prediction approached through two primary methods: autoregressive and non-autoregressive. Autoregressive methods rely on the Markov assumption and can...

Dataset
JSON

Accurate Grid Keypoint Learning for Efficient Video Prediction

Video prediction methods generally consume substantial computing resources in training and deployment, among which keypoint-based approaches show promising improvement in...

Dataset
JSON

Caltech Pedestrian

The dataset used in the paper is a video prediction dataset with occlusions, which is used to evaluate the proposed Fast Fourier Inception Networks (FFINet) for occluded video...

Dataset
JSON

TaxiBJ

The dataset used in the paper is a video prediction dataset with occlusions, which is used to evaluate the proposed Fast Fourier Inception Networks (FFINet) for occluded video...

Dataset
JSON

MovingMNIST

MovingMNIST is a synthetic dataset for predicting the movement of two digits.

Dataset
JSON

Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

Video prediction is a challenging task. The quality of video frames from current state-of-the-art (SOTA) generative models tends to be poor and generalization beyond the...

Dataset
JSON

UVG

The UVG dataset comprises 16 video sequences of 3840×2160, recorded at 120 frames per second.

Dataset
JSON

Decomposing motion and content for natural video sequence prediction

Decomposing motion and content for natural video sequence prediction.

Dataset
JSON

SDC-Net: Video prediction using spatially-displaced convolution

We present an approach for high-resolution video frame pre-diction by conditioning on both past frames and past optical flows.

Dataset
JSON

Moving MNIST

Moving MNIST is a benchmark data set for video recognition. There are 10,000 samples including 8,000 for training and 2,000 for test. Each sample consists of 20 sequential gray...

Dataset
JSON

Sdcnet: Video prediction using spatially-displaced convolution

Spatially-displaced convolution for video prediction.

Dataset
JSON

14 datasets found