4 datasets found

Tags: Chinese Word Segmentation

Filter Results
  • CTB6

    The dataset used for Chinese word segmentation tasks.
  • MSR

    The MSR dataset is a widely used vulnerability detection dataset, consisting of 10,900 vulnerable examples and 177,736 non-vulnerable examples.
  • PKU

    The dataset used for Chinese word segmentation tasks.
  • CoNLL03

    The CoNLL03 dataset is a low-resource named entity recognition dataset. The dataset contains 4 entity types: person, location, organization, and miscellaneous entities. The...
You can also access this registry using the API (see API Docs).