3 datasets found

Groups: Multimedia

Filter Results
  • YFCC-100M

    The dataset used in the paper is the YFCC-100M dataset, a large-scale image dataset.
  • Visual Text Question Answering (VTQA)

    A new challenge named Visual Text Question Answering (VTQA) along with a corresponding dataset, which includes 23,781 questions based on 10,124 image-text pairs.
  • YFCC100M

    The dataset used in the paper is YFCC100M, a large-scale video dataset. The dataset is used for foreground and background patch extraction and object recognition tasks.
You can also access this registry using the API (see API Docs).