Multimodal Visual Patterns (MMVP) Benchmark

The Multimodal Visual Patterns (MMVP) benchmark is a dataset used to evaluate the visual question answering capabilities of multimodal large language models (MLLMs).

Data and Resources

Cite this as

Shengbang Tong, Yi Ma, Zhuang Liu, Yann LeCun, Yuexiang Zhai, Saining Xie (2024). Dataset: Multimodal Visual Patterns (MMVP) Benchmark. https://doi.org/10.57702/7mfkee4u

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2401.06209
Author Shengbang Tong
More Authors
Yi Ma
Zhuang Liu
Yann LeCun
Yuexiang Zhai
Saining Xie