-
CONLL 2002
The dataset used for evaluation of the proposed model. -
I2B2 2009 Medical Information Extraction Challenge
Named Entity Recognition in Electronic Health Records using Transfer Learning Bootstrapped Neural Networks -
ClubFloyd dataset
The ClubFloyd dataset is a collection of human transcripts of text-based games, used to train action candidate generators. -
Global pointer: Novel efficient span-based approach for named entity recognition
Global pointer: Novel efficient span-based approach for named entity recognition. -
Twitter Name Tagging (TNT) and Broad Twitter Corpus (BTC)
Twitter Name Tagging (TNT) and Broad Twitter Corpus (BTC) datasets are used for named entity recognition in social media. -
Chinese OntoNotes v5.0
This dataset is used for Named Entity Recognition (NER) tasks. -
LaptopReview dataset
The LaptopReview dataset contains 3,012 mentions to laptop features. -
CoNLL 2003 dataset
The CoNLL 2003 dataset is a collection of news-wire articles used for sequence labeling tasks. -
Seungjeongwon Corpus
The Seungjeongwon corpus is a historical corpus that contains the diary of a royal secretary from the Joseon Dynasty, with annotated named entities and punctuation markers.