RAMM: Retrieval-augmented Biomedical Visual Question Answering

doi:doi:10.57702/fxt8j2bb

RAMM: Retrieval-augmented Biomedical Visual Question Answering

A retrieval-augmented pretrain-and-ﬁnetune paradigm for biomedical VQA which includes a high-quality image-text pairs PMCPM, a pre-trained multi-modal model, and a novel retrieval-augmented attention module for ﬁne-tuning.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Zheng Yuan, Qiao Jin, Chuanqi Tan, Zhengyun Zhao, Hongyi Yuan, Fei Huang, Songfang Huang (2024). Dataset: RAMM: Retrieval-augmented Biomedical Visual Question Answering. https://doi.org/10.57702/fxt8j2bb

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2303.00534
Author	Zheng Yuan
More Authors	Qiao Jin Chuanqi Tan Zhengyun Zhao Hongyi Yuan Fei Huang Songfang Huang