AstroMLab 1: Who Wins Astronomy Jeopardy!?

A comprehensive evaluation of proprietary and open-weights large language models using the first astronomy-specific benchmarking dataset.

Data and Resources

Cite this as

Yuan-Sen Ting, Tuan Dung Nguyen, Tirthankar Ghosal, Rui Pan, Hardik Arora, Zechang Sun, Tijmen de Haan, Nesar Ramachandra, Azton Wells, Sandeep Madireddy, Alberto Accomazzi (2024). Dataset: AstroMLab 1: Who Wins Astronomy Jeopardy!?. https://doi.org/10.57702/0pvkq94j

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2407.11194
Author Yuan-Sen Ting
More Authors
Tuan Dung Nguyen
Tirthankar Ghosal
Rui Pan
Hardik Arora
Zechang Sun
Tijmen de Haan
Nesar Ramachandra
Azton Wells
Sandeep Madireddy
Alberto Accomazzi