Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Formats: JSON Filter Results SNLI-VE The dataset used in the paper is a set of sequential vision-and-language tasks, where each task consists of an image and a text input. Dataset JSON