Complete Public Reddit Comments Corpus (2007-2015)
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Repository of televised news, including (for many) captions and rough statistics for content
Unstructured dataset of open-source media articles
List of datasets used to study opinion mining, sentiment analysis, and opinion spam detection
Dataset of over 5.8 million Amazon product reviews (including information on product, rating, review text, and more)
Dataset on more than 1800 U.S. criminal conviction exonerations (beginning in 1989), including information on the individual exoneree and their case
Dataset of information derived from and related to one million contemporary songs, with more than 50 variables (including information on track metadata, social networks, and more)
Dataset of information about all of the papers presented at the 2015 Neural Information Processing Systems (NIPS) Conference
API for current and historical flight information, including flight path, weather, aircraft type, airport details, connections, and more
API for current and (recent) historical flight information, including flight path, speed, aircraft type, airport details, and more