-
NeurIPS, AAN, NSF Abstracts
NeurIPS, AAN, NSF Abstracts -
Event Location Dataset
A dataset of around 8,000 labeled sentences in English, each of which is annotated with an event verb and its corresponding location or locations. -
FLICKR-25K
The dataset used for cross-modal hashing task, containing image and text data. -
MoleculeNet
The MoleculeNet dataset is a collection of molecular property prediction tasks. It contains 17 datasets, each with a different type of molecular graph. -
Trafficking-10k
The Trafficking-10k dataset contains more than 10,000 advertisements annotated for the task of detecting human trafficking. The dataset contains two sources of information per... -
ActivityNet Captions
The ActivityNet Captions is a benchmark dataset proposed for dense video captioning. There are 20K untrimmed videos in total, and each video has several annotated segments with... -
MMVet Dataset
The dataset used for testing the Vary-base model, containing MMVet dataset. -
DocVQA and ChartQA Datasets
The dataset used for testing the Vary-base model, containing DocVQA and ChartQA datasets. -
Document-Level OCR Dataset
The dataset used for testing the Vary-base model, containing document-level OCR test set. -
Natural Image-Text Dataset
The dataset used for training the Vary-base model, containing natural image-text pairs. -
Document and Chart Dataset
The dataset used for training the new vision vocabulary network, containing high-resolution document and chart images with corresponding text.