Data Resources: Repositories and Lists

Here you'll find a list of all of the data resources that we classify as repositories or lists. A repository contains multiple separate datasets in a single interface, while a list is simply a series of links to repositories or individual datasets. (More information on our taxonomy of data resources can be found on our About Us page.)

A subset of useful information about each resource is included on this page, but more information (including links, example publications, and how to gain access to it) can be found by clicking on the resource's name.

Title Type Description Applicable Fields
Inter-university Consortium for Political and Social Research (ICPSR) Repository

Repository of physical and electronic data for various purposes, including storage for replication, dissemination, and preservation

political science, social sciences, economics
Internet Archive Repository

Repository of multimodal archived data from websites and uploaded individually. Includes a variety of data: websites, books, videos, audio, television, software, images, concerts, and collections.

all
inventory.data.gov (US) Repository

Repository of databases on a government information on a wide range of topics

social sciences, law, health, public policy, culture
Kaggle Repository

Publicly available datasets (and, when available, related scripts)

all
Learning, Recognition, & Surveillance Lab Downloads List

List of datasets and code used for a variety of automatic classifications, including team behavior, consumer behavior, and face detection

computer vision, classification, psychology, teamwork, perception, motor behavior, motor control, affect, emotion, event recognition, action recognition
LearnSphere List

List of databases and repositories for learning-related research

learning science, psychology, educational psychology
Linguistic Data Consortium Repository

Repository of spoken and text corpora in multiple languages (including Arabic, English, German, Japanese, Mandarin, Spanish, and more)

communication, linguistics, language, cross-cultural analysis, linguistic variation
List of databases for machine learning research (Wikipedia) List

List of datasets divided by data type: image data, text data, sound data, signal data, physical data, biological data, anomaly data, and multivariate data. Intended to help machine learning data.

all
Mendeley Data Repository

Repository of "scientific research data" across a variety of fields for sharing and archiving data

all
National Archive of Computerized Data on Aging Repository

Repository of datasets related to aging

social psychology, economics, psychology, health psychology, aging

Pages