You're currently viewing an old version of this dataset. To see the current version, click here.

Text8

Word2Vec is a distributed word embedding generator that uses an artificial neural network to learn dense vector representations of words.

Data and Resources

Cite this as

Guy Tevet, Gavriel Habib, Vered Shwartz, Jonathan Berant (2024). Dataset: Text8. https://doi.org/10.57702/23zb3ic6

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.1611.07065
Citation
  • https://doi.org/10.48550/arXiv.2008.07720
  • https://doi.org/10.48550/arXiv.1711.04755
  • https://doi.org/10.1145/3447818.3460373
  • https://doi.org/10.48550/arXiv.1612.00584
  • https://doi.org/10.48550/arXiv.1809.03702
  • https://doi.org/10.48550/arXiv.1810.12686
  • https://doi.org/10.48550/arXiv.2405.16441
Author Guy Tevet
More Authors
Gavriel Habib
Vered Shwartz
Jonathan Berant
Homepage https://mattmahoney.net/dc/textdata