Data Resources: Datasets

Here you can find a list of datasets, chunks of data on a single topic. (More information on our taxonomy of data resources can be found on our About Us page.)

A subset of useful information about each resource is included on this page, but more information (including links, example publications, and how to gain access to it) can be found by clicking on the resource's name.

Title Type Description Applicable Fields
Lending Club Statistics Dataset

Dataset on loans from Lending Club (a peer-to-peer loan service), extending back to 2007. Includes information on disbursed and rejected loans.

risk, decision making, economics, consumer behavior, classification, behavior trends, public policy
MaxMind GeoIP2 City Database Dataset

Physical location data on IPv4 and IPv6 address worldwide

classification, categorization, social psychology
Metro Transit Authority Turnstile Data Dataset

List of turnstile usage data for metro stations in New York City

public policy, behavior change, behavior trends, economics, exploration, network analysis
Million Song Dataset Dataset

Dataset of information derived from and related to one million contemporary songs, with more than 50 variables (including information on track metadata, social networks, and more)

music, tagging, categorization, language, network analysis, language use, group behavior, exploration, perception
Moss Psycholingustic Project Database Dataset

Database on picture naming and word repetition in people with aphasias. 

language, language production, language comprehension, aging, clinical psychology, neuroscience
MusicBrainz Dataset

Open encyclopedia of music metadata (also including information on users' editing behavior)

tagging, categorization, social trends, expertise, decision making, search, imitation, exploration
NAEP Data Explorer Dataset

Summary statistics and tables on nationwide data on education surveys, including information on academic performance and learning-related factors

developmental psychology, culture, education, individual differences
Name Distributions in the Social Security Area, August 1997 (US) Dataset

List of most popular baby names from 1900 to 1997

culturomics, culture, social trends
National Longitudinal Study of Adolescent to Adult Health (Add Health) Dataset

Longitudinal survey datasets (including public-use and restricted datasets) with data on social, economic, psychological and physical well-being

developmental psychology, health psychology, social sciences
NBA Player Tracking Data (Scraper) Dataset

Database of detailed NBA player statistics (including shot type, location on the court, and success), scraped using R and Python

expertise, decision making, sports psychology, practice effects, stress

Pages