Complete Public Reddit Comments Corpus (2007-2015)
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Repository of televised news, including (for many) captions and rough statistics for content
Unstructured dataset of open-source media articles
Dataset of 8 million annotated YouTube videos, including a variety of audio and visual features.
Historical data from tennis matches. Includes match and tournament results (from 2000) and head-to-head betting odds (from 2001).
Historical data from football (soccer) matches. Includes results data (from 1993), in-depth match statistics (from 2000), and betting odds (from 2000).
List of data (including paid and free options) on various sports. Includes sections on game/match results, player data, and betting odds.
Repository of multimodal data on human and animal communication
Repository of data for network analyses, including
Datasets on various socioeconomic issues within the USA (including poverty, wealth, and employment); dates vary by dataset