Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Groups: Natural Language Processing Organizations: No Organization Filter Results BERT The dataset used in this paper is a pre-trained BERT model trained on English Wikipedia and Books datasets. Dataset JSON BERT: Pre-training of deep bidirectional transformers for language understanding This paper proposes BERT, a pre-trained deep bidirectional transformer for language understanding. Dataset JSON