Repository of televised news, including (for many) captions and rough statistics for content
Dataset of 8 million annotated YouTube videos, including a variety of audio and visual features.
Dataset of annotated images (including a variety of objects and settings)
API for Flickr image service, allowing access to a wider range of Flickr data, including users, groups, photo comments, photo geolocation, and more. (See also the 100 Million Flickr Images project.)
Dataset of human-generated sketches of specific images, including information on the temporal order of each stroke in creating the sketch; also includes human and computer classifications of the sketches
Repository of three laboratory-collected datasets comprising 80,000 individual trials across three visual search tasks (classic feature search, classic conjunction search, spatial configuration search). Further detail available below.
Video data from head-mounted camera (first-person or egocentric perspective) during various tasks
Video data from head-mounted camera (first person or egocentric perspective) during activities at a theme park
Video data from head-mounted camera (first person or egocentric perspective) during interactions with inanimate objects, with some coupled eye tracking data