Repository of televised news, including (for many) captions and rough statistics for content
Unstructured dataset of open-source media articles
Dataset of 8 million annotated YouTube videos, including a variety of audio and visual features.
Transcripts from British speeches (1895 - 2015), categorized by date, speaker, party, and title
Data from audio recordings of human interaction across various regions of the United States and including a variety of speakers and contexts