Dataset - LDM

Whole Heart 3D+T Representation Learning

A whole-heart self-supervised learning framework that utilizes masked imaging modeling to automatically uncover the correlations between spatial and temporal patches throughout...
- Dataset
- JSON
LibriLight: A Benchmark for ASR with Limited or No Supervision

The LibriLight dataset is a large-scale speech corpus used for self-supervised speech recognition tasks.
- Dataset
- JSON
Anomaly Detection in Video via Self-Supervised and Multi-Task Learning

Anomaly detection in video via self-supervised and multi-task learning
- Dataset
- JSON
MonoSelfRecon

MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views
- Dataset
- JSON
FlowStep3D: Model unrolling for self-supervised scene flow estimation.

Model unrolling for self-supervised scene flow estimation.
- Dataset
- JSON
Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hal...

Dyna-LfLH is a self-supervised machine learning method for mobile robot navigation in dynamic environments.
- Dataset
- JSON
Self-supervised temporal analysis of spatiotemporal data

Small, publicly available GPS trajectory datasets (e.g., [51, 55, 80]) have varying sampling rates with incomplete trajectories, (ii) are geographically incomplete, (iii) have...
- Dataset
- JSON
SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets

Geospatial datasets are diverse, naturally spatiotemporal, and inherently multimodal (composed of two or more distinct signal types or modalities) e.g., satellite/aerial imagery...
- Dataset
- JSON
EAT: Enhanced ASR-TTS for Self-Supervised Speech Recognition

Self-supervised ASR-TTS models suffer in out-of-domain data conditions. Here we propose an enhanced ASR-TTS model that incorporates two main features: 1) The ASR→TTS direction...
- Dataset
- JSON
Partial2Complete

Point cloud completion aims to recover the complete shape based on a partial observation. The proposed Partial2Complete (P2C) framework completes point cloud objects using...
- Dataset
- JSON
Zero-Shot Automatic Pronunciation Assessment

Automatic Pronunciation Assessment (APA) is vital for computer-assisted language learning. Prior methods rely on annotated speech-text data to train Automatic Speech Recognition...
- Dataset
- JSON
Prototypical Contrastive Learning

Self-supervised learning of pretext-invariant representations.
- Dataset
- JSON
S2R2

Self-supervised visual representation learning framework for ranking random image views.
- Dataset
- JSON
Geography-aware self-supervised learning

Geography-aware self-supervised learning
- Dataset
- JSON
Training transitive and commutative multimodal transformers with LoReTTa

Training transitive and commutative multimodal transformers with LoReTTa
- Dataset
- JSON
T-Drive: Driving Directions Based on Taxi Trajectories

T-Drive dataset for learning task-agnostic feature representations of geographic locations from unlabeled GPS trajectories
- Dataset
- JSON
Probe Data and Privacy

Probe data for learning task-agnostic feature representations of geographic locations from unlabeled GPS trajectories
- Dataset
- JSON
Reachability Embeddings: Scalable Self-Supervised Representation Learning fro...

GPS trajectory dataset for learning task-agnostic feature representations of geographic locations from unlabeled GPS trajectories
- Dataset
- JSON
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the ImageNet, Places205, and VOC07 datasets for evaluation.
- Dataset
- JSON
MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

MonoDiffusion: A novel self-supervised monocular depth estimation framework by reformulating it as an iterative denoising process.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

30 datasets found