Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Organizations: No Organization Formats: JSON Filter Results Text8 Word2Vec is a distributed word embedding generator that uses an artificial neural network to learn dense vector representations of words. Dataset JSON Penn Treebank The Penn Treebank dataset contains one million words of 1989 Wall Street Journal material annotated in Treebank II style, with 42k sentences of varying lengths. Dataset JSON