-
GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks
Automatically evaluating vision-language tasks is challenging, especially when it comes to reflecting human judgments due to limitations in accounting for fine-grained details. -
GPT-4V Dataset
The GPT-4V dataset used for fine-tuning the Qwen model.