Complete Public Reddit Comments Corpus (2007-2015)
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Complete dataset of public comments posted to Reddit (http://www.reddit.com) comments from October 2007 to May 2015.
Repository of televised news, including (for many) captions and rough statistics for content
Unstructured dataset of open-source media articles
Dataset of 8 million annotated YouTube videos, including a variety of audio and visual features.
Dataset of over 13,000 images of faces (labeled with names) taken from the internet, including over 1,600 people with multiple pictures
Datasets for training affect recognition and for perception studies
Data on international student educational assessments, including information on academic performance and learning-related factors
Datasets collected from free online personality tests
Nationwide data on education surveys, including information on academic performance and learning-related factors
Summary statistics and tables on nationwide data on education surveys, including information on academic performance and learning-related factors