1 dataset found

Groups: Question Answering Organizations: No Organization

Filter Results
  • PathVQA

    The dataset used in the paper is a set of sequential vision-and-language tasks, where each task consists of an image and a text input.