Skip to content

Learning

Rocio Ng edited this page Mar 3, 2017 · 62 revisions

Data Science Educational Resources

NOTE: If you would like to suggest a resource to be added to this page please send a DM to Rocio on Slack with a link to the tool and a short description.

  • Please limit suggestions to resources you have had some experience using
  • If there is a specific topic you would like to see covered here please let her know as well. This page is a work in progress!

General

R

  • swirl: Learn R, in R
    • Neat interactive way to learn how to do many things in R
  • R for Cats
    • Being a cat/cat-lover not a strict requirement. Introduction to R for complete beginners
  • Data Science with R
    • Companion Website to the Book with the same name
  • Stat 133 at Berkeley
  • Advanced R Hadley Wickham's interactive companion for his Advanced R book. If you want to learn how the language actually works (including its weirdo 3-teired OO framework and what's really going on with its lazy evaluation), go here. The metaprogramming aspects are extremely useful (code that writes code!).

Python

Books

  • Python for Data Analysis by Wes Mckinney
    • Great resource for learning how to work with the data analysis tools in Python

Data Processing and Cleaning

  • [Getting and Cleaning Data] (https://www.coursera.org/learn/data-cleaning)
    • Important Class. Even if you don't code in R you can follow the examples and understand the proper methods and BEST PRACTICES for data processing
  • Introduction to reshape2
    • reshape2 is one of the most used packages for transforming data between long and wide formats. Useful to follow along to understand some fundamentals of data transformations

Machine Learning

Books

  • Introduction to Statistical Learning by by Trevor Hastie and Rob Tibshirani - free e-book
    • Must-read for anyone interested in Machine Learning and aimed at a less mathematically inclined audience
    • Has useful labs that can be completed in R
  • [Elements of Statistical Learning] (http://statweb.stanford.edu/~tibs/ElemStatLearn/) by Trevor Hastie, Robert Tibshirani and Jerome Friedman - free e-book
    • For anyone who wants an advanced treatment of machine learning models and methods

Articles/Blog Posts

Probability & Statistics

SQL

  • Extensive interactive tutorials

Natural Language Processing

Misc

Useful Blogs/Sites to follow