Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

Knowledge-based visual question answering (KVQA) has been extensively studied to answer visual questions with external knowledge, e.g., knowledge graphs (KGs).

Data and Resources

Cite this as

Junnan Dong, Qinggang Zhang, Huachi Zhou, Daochen Zha, Pai Zheng, Xiao Huang (2024). Dataset: Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering. https://doi.org/10.57702/fawlhdcu

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2402.12728
Author Junnan Dong
More Authors
Qinggang Zhang
Huachi Zhou
Daochen Zha
Pai Zheng
Xiao Huang