You're currently viewing an old version of this dataset. To see the current version, click here.

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Text-based video editing using MaskINT, a two-stage pipeline involving keyframe joint editing and structure-aware frame interpolation.

Data and Resources

Cite this as

Haoyu Ma, Shahin Mahdizadehaghdam, Bichen Wu, Zhipeng Fan, Yuchao Gu, Wenliang Zhao, Lior Shapira, Xiaohui Xie (2024). Dataset: MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers. https://doi.org/10.57702/d60hdpvh

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2312.12468
Author Haoyu Ma
More Authors
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
Homepage https://maskint.github.io