Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

doi:doi:10.57702/fawlhdcu

Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

Knowledge-based visual question answering (KVQA) has been extensively studied to answer visual questions with external knowledge, e.g., knowledge graphs (KGs).

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Junnan Dong, Qinggang Zhang, Huachi Zhou, Daochen Zha, Pai Zheng, Xiao Huang (2024). Dataset: Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering. https://doi.org/10.57702/fawlhdcu

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2402.12728
Author	Junnan Dong
More Authors	Qinggang Zhang Huachi Zhou Daochen Zha Pai Zheng Xiao Huang