You're currently viewing an old version of this dataset. To see the current version, click here.

Twitter OOV Word Dataset

The dataset is a collection of Twitter tweets, filtered to include only English language tweets. The dataset is used to study out-of-vocabulary (OOV) words in Twitter.

Data and Resources

Cite this as

Suman Kalyan Maity, Chaitanya Sarda, Anshit Chaudhary, Abhijeet Patil, Shraman Kumar, Akash Mondal, Animesh Mukherjee (2024). Dataset: Twitter OOV Word Dataset. https://doi.org/10.57702/6ilbd3tw

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.48550/arXiv.1602.00293
Author Suman Kalyan Maity
More Authors
Chaitanya Sarda
Anshit Chaudhary
Abhijeet Patil
Shraman Kumar
Akash Mondal
Animesh Mukherjee
Homepage https://dx.doi.org/10.1145/2818052.2869110