Data Resources: Repositories and Lists
Here you'll find a list of all of the data resources that we classify as repositories or lists. A repository contains multiple separate datasets in a single interface, while a list is simply a series of links to repositories or individual datasets. (More information on our taxonomy of data resources can be found on our About Us page.)
A subset of useful information about each resource is included on this page, but more information (including links, example publications, and how to gain access to it) can be found by clicking on the resource's name.
Title | Type | Description | Applicable Fields |
---|---|---|---|
Inter-university Consortium for Political and Social Research (ICPSR) | Repository |
Repository of physical and electronic data for various purposes, including storage for replication, dissemination, and preservation |
political science, social sciences, economics |
Internet Archive | Repository |
Repository of multimodal archived data from websites and uploaded individually. Includes a variety of data: websites, books, videos, audio, television, software, images, concerts, and collections. |
all |
inventory.data.gov (US) | Repository |
Repository of databases on a government information on a wide range of topics |
social sciences, law, health, public policy, culture |
Kaggle | Repository |
Publicly available datasets (and, when available, related scripts) |
all |
Learning, Recognition, & Surveillance Lab Downloads | List |
List of datasets and code used for a variety of automatic classifications, including team behavior, consumer behavior, and face detection |
computer vision, classification, psychology, teamwork, perception, motor behavior, motor control, affect, emotion, event recognition, action recognition |
LearnSphere | List |
List of databases and repositories for learning-related research |
learning science, psychology, educational psychology |
Linguistic Data Consortium | Repository |
Repository of spoken and text corpora in multiple languages (including Arabic, English, German, Japanese, Mandarin, Spanish, and more) |
communication, linguistics, language, cross-cultural analysis, linguistic variation |
List of databases for machine learning research (Wikipedia) | List |
List of datasets divided by data type: image data, text data, sound data, signal data, physical data, biological data, anomaly data, and multivariate data. Intended to help machine learning data. |
all |
Mendeley Data | Repository |
Repository of "scientific research data" across a variety of fields for sharing and archiving data |
all |
National Archive of Computerized Data on Aging | Repository |
Repository of datasets related to aging |
social psychology, economics, psychology, health psychology, aging |