No Organization - Organizations

Yahoo and Yelp corpora

The Yahoo and Yelp corpora dataset contains 100k sentences with greater average length.

Dataset
JSON

Youtube-VIS 2019

Unsupervised video object segmentation has made significant progress in recent years, but the manual annotation of video mask datasets is expensive and limits the diversity of...

Dataset
JSON

DAVIS2017-unsupervised

Video object segmentation is a crucial task in computer vision that involves segmenting primary objects in a video sequence.

Dataset
JSON

UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via S...

Unsupervised video object segmentation has made significant progress in recent years, but the manual annotation of video mask datasets is expensive and limits the diversity of...

Dataset
JSON

LV intraventricular septum (IVS), internal diameter (LVID), and posterior wal...

LV intraventricular septum (IVS), internal diameter (LVID), and posterior wall (LVPW) dimensions were annotated in parasternal long axis 2DE scans.

Dataset
JSON

PubMed Central Open Access Subset

PubMed Central Open Access Subset is a collection of biomedical papers.

Dataset
JSON

PMC-CLIP

PMC-CLIP: Contrastive language-image pre-training using biomedical documents.

Dataset
JSON

BioMedClip

BioMedClip: A CLIP model pretrained on image-text pairs extracted from PubMed Central repository.

Dataset
JSON

Training CLIP models on Data from Scientific Papers

Contrastive Language-Image Pretraining (CLIP) models are trained with datasets extracted from web crawls, which are of large quantity but limited quality. This paper explores...

Dataset
JSON

MUSES

The MUSES dataset is a collection of 3,697 videos, with 2,587 for training and 1,110 for testing.

Dataset
JSON

MultiTHUMOS

Temporal action localization (TAL) is a prevailing task due to its great application potential. Existing works in this field mainly suffer from two weaknesses: (1) They often...

Dataset
JSON

TemporalMaxer: Maximize Temporal Context with only Max Pooling

Temporal action localization (TAL) is a challenging task in video understanding that aims to identify and localize actions within a video sequence.

Dataset
JSON

Swedish trafﬁc-sign dataset (STSD)

The Swedish trafﬁc-sign dataset (STSD) contains 10 categories of trafﬁc signs.

Dataset
JSON

DFG trafﬁc-sign dataset

The DFG trafﬁc-sign dataset consists of 200 categories including large number of trafﬁc signs with high intra-category appearance variations.

Dataset
JSON

LSUN Churches

The dataset used for training and testing the Conditionally-Independent Pixel Synthesis (CIPS) generator.

Dataset
JSON

Flickr-Faces-HQ

Flickr-Faces-HQ contains 70,000 face images at 1024 × 1024 resolution, which were originally crawled from Flickr, manually checked to discard low-quality samples, and then...