-
Deep Compositional Robotic Planners
A dataset for training a compositional hierarchical recurrent network to follow natural language commands in continuous environments. -
MS MARCO: A Human-Generated Machine Reading Comprehension Dataset
The dataset is used for training and evaluating the MS MARCO model, a question answering model. -
Photorealistic text-to-image diffusion models with deep language understanding
The authors present a photorealistic text-to-image diffusion model with deep language understanding. -
Google Speech Commands Dataset
The Google Speech Commands Dataset contains 64,727 one-second-long utterance files which are recorded and labeled with one of 30 target categories. -
Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
Keyword spotting (KWS) plays a critical role in enabling speech-based user interactions on smart devices. Recent developments in the field of deep learning have led to wide... -
Wiki-40B, PG-19, C4, etc.
The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used various benchmarks such as Wiki-40B, PG-19, C4, etc. -
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
Multimodal models trained on large natural image-text pair datasets have exhibited astounding abilities in gener-ating high-quality images. Medical imaging data is fundamentally... -
Stanford Alpaca
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10 and CIFAR-100 datasets for image classification, and ImageNet-100... -
Cross-View Training
The dataset used in the paper for semi-supervised sequence modeling with cross-view training. -
MISMATCH: Fine-grained Evaluation of Machine-generated Text
The dataset used in the paper for fine-grained evaluation of machine-generated text with mismatch error types. -
PhotoBot: Reference-Guided Interactive Photography via Natural Language
PhotoBot is a framework for fully automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. -
FairytaleQA
The FairytaleQA dataset is a collection of open-source fairy tales downloaded from Project Gutenberg. The dataset contains 278 fairy tales with a total of 33,577 events... -
Chinese Poetry
The Chinese Poetry dataset is a dataset of Chinese poems used for language modeling. -
Switchboard
Human speech data comprises a rich set of domain factors such as accent, syntactic and semantic variety, or acoustic environment.