MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

doi:doi:10.57702/d60hdpvh

You're currently viewing an old version of this dataset. To see the current version, click here.

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Text-based video editing using MaskINT, a two-stage pipeline involving keyframe joint editing and structure-aware frame interpolation.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Haoyu Ma, Shahin Mahdizadehaghdam, Bichen Wu, Zhipeng Fan, Yuchao Gu, Wenliang Zhao, Lior Shapira, Xiaohui Xie (2024). Dataset: MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers. https://doi.org/10.57702/d60hdpvh

DOI retrieved: December 3, 2024

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Defined In	https://doi.org/10.48550/arXiv.2312.12468
Author	Haoyu Ma
More Authors	Shahin Mahdizadehaghdam Bichen Wu Zhipeng Fan Yuchao Gu Wenliang Zhao Lior Shapira Xiaohui Xie
Homepage	https://maskint.github.io