Big Data Open Source Software Projects
latest

Course

  • Syllabus
  • FAQ
  • Homework
  • Using GitLab
  • Software Projects

Python for Big Data

  • Introduction to Python
  • Python for Big Data

Useful Tools and Tips

  • Datasets
  • Reference Managers
  • Using SSH Keys
  • Links Report

APPENDIX

  • Homework References

Drafts(TODO)

  • Drafts (TODO)

Contributing

  • Todos
  • Changelog
Big Data Open Source Software Projects
  • Docs »
  • Datasets
  • Edit on GitHub

DatasetsΒΆ

Below are links to collections of datasets that may be of use for homework assignments or projects.

  • https://www.data.gov/
  • https://github.com/caesar0301/awesome-public-datasets
  • https://aws.amazon.com/public-data-sets/
  • https://www.kaggle.com/datasets
  • https://cloud.google.com/bigquery/public-data/github
  • https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public

For NIST Projects:

  • NIST Special Database 27A [4GB]
  • INRIA Person Dataset
  • Healthcare data from CMS
  • Uber Ride Sharing GPS Data
  • Census Data
Next Previous

© Copyright 2016, Gregor von Laszewski, Badi Abdul-Wahid. Revision 881c407c.

Built with Sphinx using a theme provided by Read the Docs.