Skip to content

Learning

Rocio Ng edited this page Aug 23, 2017 · 62 revisions

Data Science Educational Resources

NOTE: If you would like to suggest a resource to be added to this page please send a DM to Rocio on Slack with a link to the tool and a short description.

  • Please limit suggestions to resources you have had some experience using
  • If there is a specific topic you would like to see covered here please let her know as well. This page is a work in progress!

General

  • Code Academy
    • Free site for interactively learning to code in many languages (eg. Python, HTML, SQL, Git etc)
    • Highly recommended for absolute beginners to intermediate
  • Learn DataSci.com
    • Comprehensive list of online data science courses
  • Ranking of Intro Data Science Courses
    • Pretty extensive and methodical approach to ranking courses. Definitely check out to find a course that is right for you!

R

  • swirl: Learn R, in R
    • Neat interactive way to learn how to do many things in R
  • R for Cats
    • Being a cat/cat-lover not a strict requirement. Introduction to R for complete beginners
  • Data Science with R
    • Companion Website to the Book with the same name
  • Stat 133 at Berkeley
  • Advanced R Hadley Wickham's interactive companion for his Advanced R book. If you want to learn how the language actually works (including its weirdo 3-teired OO framework and what's really going on with its lazy evaluation), go here. The metaprogramming aspects are extremely useful (code that writes code!).

Python

Books

  • Python for Data Analysis by Wes Mckinney
    • Great resource for learning how to work with the data analysis tools in Python

Data Processing and Cleaning

  • Getting and Cleaning Data
    • Important Class. Even if you don't code in R you can follow the examples and understand the proper methods and BEST PRACTICES for data processing
  • Introduction to reshape2
    • reshape2 is one of the most used packages for transforming data between long and wide formats. Useful to follow along to understand some fundamentals of data transformations

Machine Learning

Books

  • Introduction to Statistical Learning by by Trevor Hastie and Rob Tibshirani - free e-book
    • Must-read for anyone interested in Machine Learning and aimed at a less mathematically inclined audience
    • Has useful labs that can be completed in R
  • Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani and Jerome Friedman - free e-book
    • For anyone who wants an advanced treatment of machine learning models and methods

Articles/Blog Posts

Probability & Statistics

SQL

  • Extensive interactive tutorials

Natural Language Processing

Spark

Deep Learning

  • Deep Learning
    • Free Online Textbook by Ian Goodfellow, Yoshua Bengio, Aaron Courville
    • Intro Chapters also cover a lot of the math needed for Deep Learning Models

Misc

Useful Blogs/Sites to follow