Complete Public Reddit Comments Corpus (2007-2015)
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Repository of televised news, including (for many) captions and rough statistics for content
Dataset of 8 million annotated YouTube videos, including a variety of audio and visual features.
List of datasets used to study opinion mining, sentiment analysis, and opinion spam detection
Dataset of over 5.8 million Amazon product reviews (including information on product, rating, review text, and more)
List of freely available datasets with up to 18 public metrics of cities within the U.S. (e.g., crime, zoning, health inspections, transit)
List of freely available datasets about police-citizen interactions within U.S. cities
Curated repository of datasets about consumer energy behavior
Repository of multimodal data on child language and communication (subset of TalkBank)
Google search data, available for download