1 dataset found

Formats: JSON Tags: comprehensive evaluation

Filter Results
  • MME

    MME: A comprehensive evaluation benchmark for multimodal large language models
You can also access this registry using the API (see API Docs).