-
Aggrefact-Unified dataset
The Aggrefact-Unified dataset is a collection of news documents and summaries with factual errors. -
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels
A large chest radiograph dataset with labels for common thorax diseases. -
ChestX-ray14: A Large Chest Radiograph Dataset with Uncertainty Labels
A large chest radiograph dataset with labels for common thorax diseases. -
Multi-Label Continual Learning for Medical Imaging: A Novel Benchmark
A novel benchmark for multi-label image classification in medical imaging, combining new classes and domains into a challenging scenario. -
SMM4H 2021 Task 2, Russian tweets
The dataset for SMM4H 2021 Task 2, Russian tweets, includes tweets that report adverse drug effects (ADEs) or drug reactions (ADRs). -
SMM4H 2021 Task 1a, English tweets
The dataset for SMM4H 2021 Task 1a, English tweets, includes tweets that report adverse drug effects (ADEs) or drug reactions (ADRs). -
SMM4H 2020 Task 1b, French tweets
The dataset for SMM4H 2020 Task 1b, French tweets, includes tweets that report adverse drug effects (ADEs) or drug reactions (ADRs). -
The techqa dataset
TechQA: a dataset for question answering on technical support articles -
VAULT: VAriable Unified Long Text representation for Machine Reading Comprehen...
VAULT: a light-weight and parallel-efficient paragraph representation for Machine Reading Comprehension (MRC) based on contextualized representation from long document input -
SParC and CoSQL
The dataset used in the paper is SParC and CoSQL, two large complex cross-domain context-dependent text-to-SQL datasets. -
Point Grey Bumblebee2 1394a1 and ZED Stereo Camera2
The Point Grey Bumblebee2 1394a1 and the ZED Stereo Camera2 are used to evaluate the proposed stereo matching method. -
Simulated dataset for HARQ feedback prediction
The dataset used in the paper is a simulated dataset obtained from stochastic channel models, which are widely used for performance evaluations of physical layer techniques. -
Yelp Weeplaces
Yelp Weeplaces dataset contains check-ins on POIs over all major cities in the United States, across various categories. -
GroupIM: A Mutual Information Maximization Framework for Neural Group Recomme...
We study the problem of making item recommendations to ephemeral groups, which comprise users who purchase very few (or no) items together. -
Satellite Depth Completion Dataset
A large-scale satellite depth completion dataset for training and testing spacecraft depth completion algorithms. -
WebWISE: Web Interface Control and Sequential Exploration with Large Language...
The paper investigates using a Large Language Model (LLM) to automatically perform web software tasks using click, scroll, and text input operations. -
Grasp dataset for robotic grasping
A dataset of successful, cylindrical precision robotic grasps using the V-REP simulator and object files provided by Kleinhans et al. on a simulated "picking" task. -
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A S...
The dataset for the paper titled Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey -
SexHateLex
The SexHateLex lexicon is a large collection of sexist and abusive terms in Chinese.