No Organization - Organizations

SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection

The SWSR dataset consists of two files: SexWeibo.csv and SexComment.csv, containing weibos (posts) and comments (replies) respectively.
- Dataset
- JSON
Domain-Specific Preference (DSP) set

We collect a domain-specific preference (DSP) dataset, which includes preferred responses for each given query from four practical domains.
- Dataset
- JSON
CloCom

A dataset for automatic comment generation from source code.
- Dataset
- JSON
Boa

A dataset for automatic comment generation from source code.
- Dataset
- JSON
CODE-NN

A dataset for automatically generating summary comments for Java methods.
- Dataset
- JSON
C2CGit

A large dataset from open projects in GitHub, which is more than 20× larger than existing datasets.
- Dataset
- JSON
Youshu

The dataset Youshu is a bundle recommendation dataset, which contains 995 groups, 5,275 users, and 1,513 items. The dataset also includes 4,771 bundles, 32,770 user-bundle...
- Dataset
- JSON
Mafengwo

The dataset Mafengwo is a group recommendation dataset, which contains 995 groups, 5,275 users, and 1,513 items. The dataset also includes 4,771 bundles, 32,770 user-bundle...
- Dataset
- JSON
Huawei

Huawei is a music service system.
- Dataset
- JSON
Amazon-Beauty/Amazon-Digital-Music

Amazon-Beauty/Amazon-Digital-Music is a subset of Amazon Product dataset.
- Dataset
- JSON
MIND3

MIND3 is a large scale news recommendation dataset.
- Dataset
- JSON
FIGER dataset

The FIGER dataset contains 2M data samples labeled with 113 types.
- Dataset
- JSON
OntoNotes dataset

The OntoNotes dataset contains 3.4M automatically labeled entity mentions for training and 11k manually annotated instances that are split into 8k for dev set and 2k for test set.
- Dataset
- JSON
Normality Test Dataset

The dataset used to evaluate the performance of the neural network for normality test.
- Dataset
- JSON
REFIND: Relation Extraction Financial Dataset

Relation extraction financial dataset
- Dataset
- JSON
Recipe1M+ Dataset

The Recipe1M+ dataset is a large collection of culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions.
- Dataset
- JSON
Assorted, Archetypal, and Annotated Two Million (3A2M) Cooking Recipe Dataset

The 3A2M dataset is a large collection of culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions.
- Dataset
- JSON
Assorted, Archetypal, and Annotated Two Million Extended (3A2M+) Cooking Reci...

The 3A2M+ dataset is a large collection of culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions.
- Dataset
- JSON
Audrey Dataset

The dataset used for training the models in the Audrey chatbot.
- Dataset
- JSON
Google Maps demonstration dataset

The dataset used in this paper is a large-scale demonstration dataset containing 110M and 10M training and validation samples, respectively.
- Dataset
- JSON

244 datasets found