Dataset - LDM

SOGAR: Self-supervised spatiotemporal attention-based social group activity r...

SOGAR: Self-supervised spatiotemporal attention-based social group activity recognition.
- Dataset
- JSON
Group activity recognition using self-supervised approach of spatiotemporal t...

Group activity recognition using self-supervised approach of spatiotemporal transformers.
- Dataset
- JSON
KITTI Odometry

The dataset used in the paper is a large-scale point cloud compression framework, which can organize sparse and un-structured point clouds in a memory-efficient way.
- Dataset
- JSON
KITTI2015

Self-supervised monocular depth estimation from monocular video sequences
- Dataset
- JSON
Divide-and-Rule: Self-Supervised Learning for Survival Analysis in Colorectal...

A self-supervised learning method for learning histopathological patterns within cancerous tissue regions.
- Dataset
- JSON
PointCAM

Self-supervised adversarial masking for 3D point cloud representation learning
- Dataset
- JSON
MiniVox

MiniVox is an automatic framework to transform any speaker-into continuous speech datastream with labelled dataset episodically revealed label feedbacks.
- Dataset
- JSON
ANALYSING DISCRETE SELF SUPERVISED SPEECH REPRESENTATION FOR SPOKEN LANGUAGE ...

This work profoundly analyzes discrete self-supervised speech representations (units) through the eyes of Generative Spoken Language Modeling (GSLM).
- Dataset
- JSON
OctoPath test dataset

The dataset used for testing the OctoPath network, containing sequences of octrees and points from the reference path.
- Dataset
- JSON
OctoPath dataset

The dataset used for training the OctoPath network, containing sequences of octrees and points from the reference path.
- Dataset
- JSON
LeBenchmark7K

The LeBenchmark7K dataset is a self-supervised representation of French speech.
- Dataset
- JSON
DeepPS2

Photometric stereo with two images using a self-supervised deep learning framework called DeepPS2.
- Dataset
- JSON
PHIMO - Physics-Informed Deep Learning for Motion-Corrected Reconstruction of...

PHIMO, a physics-informed motion correction method tailored to quantitative MRI, which utilises information from the MR signal evolution to detect motion events with a...
- Dataset
- JSON
Self-supervised Relational RL with Independently Controllable Subgoals

The dataset used in the paper is a multi-object environment with a robotic arm and multiple objects to manipulate. The agent learns to control the objects independently and...
- Dataset
- JSON
SoundNet

The dataset is used for learning general and effective models for both audio and video analysis from self-supervised temporal synchronization.
- Dataset
- JSON
Heartheflow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

Learning to localize the sound source in videos without explicit annotations is a novel area of audio-visual research. Existing work in this area focuses on creating attention...
- Dataset
- JSON
Structural Deep Clustering Network

Clustering is a fundamental task in data analysis. Recently, deep clustering, which derives inspiration primarily from deep learning approaches, achieves state-of-the-art...
- Dataset
- JSON
Decoupled Contrastive Learning

Contrastive learning is one of the most successful paradigms for self-supervised learning (SSL). In a principled way, it considers two augmented views of the same image as...
- Dataset
- JSON
S3T: Self-supervised pre-training with Swin Transformer for music classification

Self-supervised pre-training method with Swin Transformer for music classification, leveraging massive unlabeled music data to improve the performance of music classification...
- Dataset
- JSON
Unsupervised Learning of Style-Aware Facial Animation from Real Acting Perfor...

A new approach for creating an animatable and photo-realistic 3D head model from multi-view video footage of a real actor, together with a neural animation model based on...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

48 datasets found