-
LibriLight: A Benchmark for ASR with Limited or No Supervision
The LibriLight dataset is a large-scale speech corpus used for self-supervised speech recognition tasks. -
EAT: Enhanced ASR-TTS for Self-Supervised Speech Recognition
Self-supervised ASR-TTS models suffer in out-of-domain data conditions. Here we propose an enhanced ASR-TTS model that incorporates two main features: 1) The ASR→TTS direction... -
Prototypical Contrastive Learning
Self-supervised learning of pretext-invariant representations. -
Geography-aware self-supervised learning
Geography-aware self-supervised learning -
T-Drive: Driving Directions Based on Taxi Trajectories
T-Drive dataset for learning task-agnostic feature representations of geographic locations from unlabeled GPS trajectories -
Probe Data and Privacy
Probe data for learning task-agnostic feature representations of geographic locations from unlabeled GPS trajectories -
Reachability Embeddings: Scalable Self-Supervised Representation Learning fro...
GPS trajectory dataset for learning task-agnostic feature representations of geographic locations from unlabeled GPS trajectories -
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the ImageNet, Places205, and VOC07 datasets for evaluation. -
Self-Distillation Prototypes Network: Learning Robust Speaker Representations...
Training speaker-discriminative and robust speaker verification systems without explicit speaker labels remains a persisting challenge. In this paper, we propose a new... -
A simple data mixing prior for improving self-supervised learning
A simple data mixing prior for improving self-supervised learning. -
EquiMod: An Equivariance Module to Improve Visual Instance Discrimination
Recent self-supervised visual representation methods are closing the gap with supervised learning performance. Most of these successful methods rely on maximizing the similarity... -
MST: Masked Self-Supervised Transformer for Visual Representation
The proposed method is a self-supervised learning approach for visual representation learning, which can explicitly capture the local context of an image while preserving the... -
SSL4EO-S12
SSL4EO-S12: A large-scale, globally distributed, multi-temporal and multi-sensor dataset for self-supervised learning in Earth observation. -
S5Mars: Self-supervised and semi-supervised learning for Mars segmentation
A self-supervised and semi-supervised learning for Mars segmentation dataset. -
MOCA: Masked Online Codebook Assignments prediction
Self-supervised representation learning for Vision Transformers (ViT) to mitigate the greedy needs of ViT networks for very large fully-annotated datasets.