Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Tags: text analysis Filter Results The Pile The Pile dataset contains 3.5 million samples of diverse text for language modeling. Dataset JSON Twitter Dataset The Twitter Dataset is a collection of tweets annotated with Plutchik's emotions, consisting of tweets in three different languages: English, Dutch, and German. Dataset JSON