6 datasets found

Tags: Named entity recognition

Filter Results
  • VLSP

    The dataset used for part-of-speech tagging and named entity recognition tasks.
  • DocRED dataset

    The DocRED dataset was built from Wikipedia and Wikidata, covering various relations related to science, art, personal life, etc.
  • Rare-NER, Bio-NER, and Twitter-POS datasets

    The Rare-NER, Bio-NER, and Twitter-POS datasets are used for named entity recognition and part-of-speech tagging.
  • Wall Street Journal

    The Wall Street Journal dataset is used for syntactic linearization. It contains a large corpus of news articles with their corresponding syntactic trees.
  • CMeEE

    The CMeEE V1 and V2 datasets for Chinese nested medical NER.
  • Wiki-40B, PG-19, C4, etc.

    The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used various benchmarks such as Wiki-40B, PG-19, C4, etc.
You can also access this registry using the API (see API Docs).