Amazon Corpus

The Amazon corpus contains user product reviews and has a much higher vocabulary relative to the number of documents, due to its noisy text.

Data and Resources

Cite this as

Wilson Fearn, Orion Weller, Kevin Seppi (2025). Dataset: Amazon Corpus. https://doi.org/10.57702/ti3v8v42

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2104.03848
Author Wilson Fearn
More Authors
Orion Weller
Kevin Seppi
Homepage https://jmcauley.ucsd.edu/data/amazon/