Griffon v2

Griffon v2 is a high-resolution multimodal model supporting resolutions up to 1K and facilitating visual-language co-referring.

Data and Resources

Cite this as

Yufei Zhan, Yousong Zhu, Hongyin Zhao, Fan Yang, Ming Tang, Jinqiao Wang (2025). Dataset: Griffon v2. https://doi.org/10.57702/rep59jrt

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2403.09333
Author Yufei Zhan
More Authors
Yousong Zhu
Hongyin Zhao
Fan Yang
Ming Tang
Jinqiao Wang
Homepage https://github.com/jefferyZhan/Griffon