UniDiff

UniDiff is a unified vision-language model that integrates discriminative and generative capabilities in vision-language tasks.

Data and Resources

Cite this as

Xiao Dong, Runhui Huang, Xiaoyong Wei, Zequn Jie, Jianxing Yu, Jian Yin, Xiaodan Liang (2024). Dataset: UniDiff. https://doi.org/10.57702/5mbf8orw

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2306.00813
Author Xiao Dong
More Authors
Runhui Huang
Xiaoyong Wei
Zequn Jie
Jianxing Yu
Jian Yin
Xiaodan Liang
Homepage https://arxiv.org/abs/2209.15264