-
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection
The SWSR dataset consists of two files: SexWeibo.csv and SexComment.csv, containing weibos (posts) and comments (replies) respectively. -
Domain-Specific Preference (DSP) set
We collect a domain-specific preference (DSP) dataset, which includes preferred responses for each given query from four practical domains. -
Amazon-Beauty/Amazon-Digital-Music
Amazon-Beauty/Amazon-Digital-Music is a subset of Amazon Product dataset. -
FIGER dataset
The FIGER dataset contains 2M data samples labeled with 113 types. -
OntoNotes dataset
The OntoNotes dataset contains 3.4M automatically labeled entity mentions for training and 11k manually annotated instances that are split into 8k for dev set and 2k for test set. -
Normality Test Dataset
The dataset used to evaluate the performance of the neural network for normality test. -
REFIND: Relation Extraction Financial Dataset
Relation extraction financial dataset -
Recipe1M+ Dataset
The Recipe1M+ dataset is a large collection of culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions. -
Assorted, Archetypal, and Annotated Two Million (3A2M) Cooking Recipe Dataset
The 3A2M dataset is a large collection of culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions. -
Assorted, Archetypal, and Annotated Two Million Extended (3A2M+) Cooking Reci...
The 3A2M+ dataset is a large collection of culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions. -
Audrey Dataset
The dataset used for training the models in the Audrey chatbot. -
Google Maps demonstration dataset
The dataset used in this paper is a large-scale demonstration dataset containing 110M and 10M training and validation samples, respectively.