1 dataset found

Tags: comprehensive evaluation

Filter Results
  • MME

    MME: A comprehensive evaluation benchmark for multimodal large language models
You can also access this registry using the API (see API Docs).