1 dataset found

Tags: Language Model Evaluation

Filter Results
  • Qiyas Benchmark

    The Qiyas benchmark is a standardized General Aptitude Test (GAT) used for university admissions in Saudi Arabia, ensuring its quality and relevance to real-world assessment. It...
You can also access this registry using the API (see API Docs).