Self-Supervised Learning - Groups

DINOv2

The dataset used in the paper is DINOv2, a vision foundation model trained on a large-scale dataset.

Dataset
JSON

OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the ImageNet, Places205, and VOC07 datasets for evaluation.

Dataset
JSON

EquiMod: An Equivariance Module to Improve Visual Instance Discrimination

Recent self-supervised visual representation methods are closing the gap with supervised learning performance. Most of these successful methods rely on maximizing the similarity...

Dataset
JSON

MST: Masked Self-Supervised Transformer for Visual Representation

The proposed method is a self-supervised learning approach for visual representation learning, which can explicitly capture the local context of an image while preserving the...

Dataset
JSON

MOCA: Masked Online Codebook Assignments prediction

Self-supervised representation learning for Vision Transformers (ViT) to mitigate the greedy needs of ViT networks for very large fully-annotated datasets.

Dataset
JSON

5 datasets found

DINOv2

OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning

EquiMod: An Equivariance Module to Improve Visual Instance Discrimination

MST: Masked Self-Supervised Transformer for Visual Representation

MOCA: Masked Online Codebook Assignments prediction