Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Groups: Text Corpus Organizations: No Organization Filter Results Text8 Word2Vec is a distributed word embedding generator that uses an artificial neural network to learn dense vector representations of words. Dataset JSON BookCorpus The dataset used in this paper for unsupervised sentence representation learning, consisting of paragraphs from unlabeled text. Dataset JSON