-
Synthetic two-dimensional data and MNIST digits
The dataset used in the experiments with the synthetic two-dimensional data and the MNIST digits. -
Paired-Embedding (PE) method for data augmentation and Act2Act network
Paired-Embedding (PE) method for effective and reliable data augmentation, Act2Act network learns from augmented data -
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instan...
MosaicFusion: A simple yet effective diffusion-based data augmentation approach for large vocabulary instance segmentation. -
Semantic Diffusion Model
The Semantic Diffusion Model (SDM) is used for synthesizing pulmonary CT images from segmentation maps for data augmentation. -
SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup
Active learning is an important technique for low-resource sequence labeling tasks. However, current active sequence labeling methods use the queried samples alone in each... -
Contextual augmentation: Data augmentation by words with paradigmatic relations
Contextual augmentation: Data augmentation by words with paradigmatic relations. -
RCT: RANDOM CONSISTENCY TRAINING FOR SEMI-SUPERVISED SOUND EVENT DETECTION
Sound event detection (SED) aims to detect sound events within an audio stream by labeling the events as well as their corresponding occurrence timestamps. -
Boomerang: Local sampling on image manifolds using diffusion models
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used it for data anonymization, data augmentation, and image perceptual quality... -
Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormalit...
A two-stage cytopathological image synthesis framework for augmenting cervical abnormality screening. -
Dirty Cityscapes
A dataset of 10k images with artificially generated soiling patterns, used for training and testing the soiling detection model. -
Dirty WoodScape
A companion dataset to the WoodScape dataset, containing 10k images with artificially generated soiling patterns. -
Generating Photo-Realistic Training Data to Improve Face Recognition Accuracy
Face datasets are widely used in face recognition tasks, but they are often limited by their size and quality. This paper proposes a method for generating photo-realistic... -
ND-MLS dataset
The bottle, grass, cat, and horse datasets were created for semantic segmentation tasks. The datasets contain images of 4 object types. The ND-MLS dataset was evaluated on... -
Omniglot dataset
The Omniglot dataset consists of 100 classes, each containing 20 images. Ten images were taken from each class for augmentation, and the rest were used as the test set. Each... -
Dataset for Image-to-Image Translation for Semantic Segmentation
The dataset used for the experiments with the proposed approach to augment image data for semantic segmentation networks by applying image-to-image translation with both, a... -
Cross Domain Generative Augmentation
The authors propose a novel data augmentation method called Cross Domain Generative Augmentation (CDGA) to reduce the estimation error of Empirical Risk Minimization (ERM) under... -
Mono-ViFI: A Unified Framework for Self-supervised Monocular Depth Estimation
Self-supervised monocular depth estimation has gathered no-table interest since it can liberate training from dependency on depth annotations. In monocular video training case,... -
MixGen: A New Multi-Modal Data Augmentation
MixGen: a joint data augmentation for vision-language representation learning to further improve data efficiency.