Memory Colors Dataset

Two benchmark sets for measuring the extent to which language-and-vision language models use the visual signal in the presence or absence of stereotypes.

BibTex: