1 dataset found

Groups: Language Model Evaluation

Filter Results
  • Qiyas Benchmark

    The Qiyas benchmark is a standardized General Aptitude Test (GAT) used for university admissions in Saudi Arabia, ensuring its quality and relevance to real-world assessment. It...