List of datasets used to study opinion mining, sentiment analysis, and opinion spam detection
Dataset of over 5.8 million Amazon product reviews (including information on product, rating, review text, and more)
Historical data from tennis matches. Includes match and tournament results (from 2000) and head-to-head betting odds (from 2001).
Historical data from football (soccer) matches. Includes results data (from 1993), in-depth match statistics (from 2000), and betting odds (from 2000).
List of data (including paid and free options) on various sports. Includes sections on game/match results, player data, and betting odds.
Dataset examining how European Union member states’ gender equality policies impact a number of areas (e.g., health, economics) from 2005-2015
Dataset on loans from Lending Club (a peer-to-peer loan service), extending back to 2007. Includes information on disbursed and rejected loans.
Dataset with approximately 30 years' worth of information about companies and trusts in 10 offshore countries, including officer information and more
Dataset of the famous Panama Papers with information on offshore accounts, companies, and trusts
Dataset from the U.S. Department of Education that includes various metrics on outcomes from degree-granting undergraduate institutions from 1996-2015, including student debt, college completion rates, job placement, and more