-
LAPTOP dataset
The LAPTOP dataset is used for aspect-based sentiment analysis, containing review sentences along with gold standard aspect sentiment annotations. -
ChnSentiCorp
ChnSentiCorp is a dataset used for sentiment classification in Chinese documents, where the text is classified into positive or negative labels. -
WMT19 News Translation Dataset
The dataset includes authentic parallel data with and without document boundaries, as well as back-translated data to enhance the training of document-level translation models. -
NIST Chinese-English Test Dataset
NIST test sets used as evaluation benchmarks for Chinese to English translation performance. -
WMT14 English-French and English-German Dataset
WMT14 dataset consisting of English to French and English to German translations used as test sets for evaluating the robustness of the machine translation systems. -
Parallel Translation Dataset for NMT
The dataset includes parallel translation data used to train victim models for evaluating adversarial attacks in neural machine translation tasks. -
GOCS Technology for Geostationary Orbit Complex Satellite
This dataset pertains to geostationary orbit complex satellite technology, comprising valid patents that have undergone expert validation. -
MRRG Technology for Micro Radar Rain Gauge
This dataset includes technology focused on micro radar rain gauge systems, with a thorough filtering process to identify valid patents. -
1MWDFS Technology for 1MW Dual Frequency System
A dataset detailing the technology for 1MW dual frequency systems, containing valid patents that have been curated based on expert recommendations. -
MPUART Marine Plant Using Augmented Reality Technology
A dataset focused on marine plant technologies using augmented reality. It includes a comprehensive list of patents related to this technology, filtered for validity based on...