FLIR, MFNet, COME15K, MCXFace

Cross-modal datasets with text descriptions for cross-modal image generation under various layout conditions.

Data and Resources

Cite this as

Zeyu Wang, Jingyu Lin, Yifei Qian, Yi Huang, Shicen Tian, Bosong Chai, Juncan Deng, Qu Yang, Lan Du, Cunjian Chen, Yufei Guo (2024). Dataset: FLIR, MFNet, COME15K, MCXFace. https://doi.org/10.57702/cm2do1mg

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2407.15488
Author Zeyu Wang
More Authors
Jingyu Lin
Yifei Qian
Yi Huang
Shicen Tian
Bosong Chai
Juncan Deng
Qu Yang
Lan Du
Cunjian Chen
Yufei Guo
Homepage https://github.com/zeyuwang-zju/DiffX