1 dataset found

Tags: instruction-based evaluation

Filter Results
  • EditEval

    A benchmark for text improvements, focusing on instruction-based evaluation.
You can also access this registry using the API (see API Docs).