Data

There are a ton of places to find data related to whatever you want. The ones below are some of the more larger repositories:

  • Data World: A plethora of open data sets to peruse. If you are a data fiend, consider collaborating to solve problems.

  • Data is Plural newsletter: Jeremy Singer-Vine sends a weekly newsletter of the most interesting public datasets he’s found. You should subscribe to it. He also has an archive of all the datasets he’s highlighted.

  • Google Dataset Search: Google indexes thousands of public datasets; search for them here.

  • Kaggle: Kaggle hosts machine learning competitions where people compete to create the fastest, most efficient, most predictive algorithms. A byproduct of these competitions is a host of fascinating datasets that are generally free and open to the public.

  • US City Open Data Census: More than 100 US cities have committed to sharing dozens of types of data, including data about crime, budgets, campaign finance, lobbying, transit, and zoning. This site from the Sunlight Foundation and Code for America collects this data and rates cities by how well they’re doing.