30 datasets found

Tags: text analysis

Filter Results
  • SemEval-2014 ABSA Restaurant Reviews

    The SemEval-2014 ABSA Restaurant Reviews dataset is a collection of restaurant reviews annotated with sentiment.
  • Presidential Speech Dataset

    The Presidential speech dataset is a collection of speeches from 43 U.S. presidents, with each speech annotated with sentiment.
  • UKWaC and Wackypedia corpora

    The dataset used in this paper is a large text corpus compiled from UKWaC and Wackypedia corpora.
  • Yelp Dataset

    The Yelp Dataset contains 1.6M reviews and 500K tips by 366K users for 61K businesses; 481K business attributes, such as hours, parking availability, ambience; and check-ins for...
  • Yelp Dataset Challenge

    The Yelp dataset challenge contains reviews and images of restaurants, with the goal of recommending images for each review.
  • Yelp Reviews

    Yelp Reviews is a large dataset of customer reviews.
  • Yahoo Answers

    The dataset Yahoo Answers contains 730,000 questions and answers.
  • Penn Treebank

    The Penn Treebank dataset contains one million words of 1989 Wall Street Journal material annotated in Treebank II style, with 42k sentences of varying lengths.
  • BookCorpus

    The dataset used in this paper for unsupervised sentence representation learning, consisting of paragraphs from unlabeled text.
  • MIMIC-III

    MIMIC-III is a large, publicly available clinical database containing information from the Medical Information Mart for Intensive Care III. It is used for various clinical...
You can also access this registry using the API (see API Docs).