Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Tags: Alpaca Eval 2 Filter Results Alpaca Eval 2 The dataset used in the paper is Alpaca Eval 2, which is an automated metric that measures LLMs' alignment with human preferences. Dataset JSON