-
Notifications
You must be signed in to change notification settings - Fork 6.2k
[Transcript] Clare Corthell
I couldn't wait to go back to grad school. Literally. So I designed my own grad school and spent 5 months learning & hacking in great delight!
My Background (linkedin)
I'm a Stanford-educated Engineer, previously a Front-End Developer and UX Designer on early-stage products. I'm always in hot pursuit of deeper insight to social questions!
Data Science is an ideal marriage for my technical capacities, social research inquisitions, and my geekish-freakish love of statistics.
I'm now a Data Scientist with an incredible team at Mattermark!
- Intro to Data Science UW / Coursera
- Topics: Python NLP on Twitter API, Distributed Computing Paradigm, MapReduce/Hadoop & Pig Script, SQL/NoSQL, Relational Algebra, Experiment design, Statistics, Graphs, Amazon EC2, Visualization.
- Linear Algebra / Levandosky Stanford / Book
- Statistics Stats in a Nutshell / Book
- Problem-Solving Heuristics "How To Solve It" Polya / Book
-
Algorithms
-
Algorithms Design & Analysis I Stanford / Coursera
-
Algorithm Design Kleinberg & Tardos / Book
-
Databases
-
Introduction to Databases Stanford / Coursera
-
Data Mining
-
Mining Massive Data Sets Stanford / Book
-
Mining The Social Web O'Reilly / Book
-
Introduction to Information Retrieval Stanford / Book
-
Data wrangling with MongoDB Udacity / Course
-
Machine Learning
-
Machine Learning / Ng Stanford / Coursera
-
Programming Collective Intelligence O'Reilly / Book
-
Statistics The Elements of Statistical Learning / Book ** en process
-
Probabilistic Graphical Models
-
Probabilistic Programming and Bayesian Methods for Hackers [Github / Tutorials] (https://github.com/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers)
-
PGMs / Koller Stanford / Coursera ** en process
-
Natural Language Processing
-
NLP with Python O'Reilly / Book
-
Analysis
-
Python for Data Analysis O'Reilly / Book
-
Big Data Analysis with Twitter UC Berkeley / Lectures
-
Social and Economic Networks: Models and Analysis / Stanford / Coursera
-
Information Visualization "Envisioning Information" Tufte / Book
-
Python (Learning)
-
New To Python: Learn Python the Hard Way, Google's Python Class
-
Python (Libraries)
-
Basic Packages Python, virtualenv, NumPy, SciPy, matplotlib and IPython
-
Bayesian Inference | pymc
-
Labeled data structures objects, statistical functions, etc pandas (See: Python for Data Analysis)
-
Python wrapper for the Twitter API twython
-
Tools for Data Mining & Analysis scikit-learn
-
Network Modeling & Viz networkx
-
Natural Language Toolkit NLTK
- Coursework
- Sentiment analysis, trending topics, and friendship mapping with Twitter API
- Joins and Matrix Manipulation in MapReduce (AWS EC2)
- In-database Text analysis (SQL)
- Sentiment analysis of movie tweets (Python)
This degree is brought to you by: "THE INTERNET".
Information is more democratized^ now than it was at any point in history. Given a little initiative and interest, you can tailor and excel in an education of your own design. The connective web made me what I am today, growing from the child obsessed with Number Munchers to an adult jaw-dropping over DBSCAN.
The most valuable resources I used were:
- Coursera
- Khan Academy
- Wolfram Alpha
- Wikipedia
- Quora
- Kindle .mobis (carrying textbooks is so 90s.)
- PopSci Read: The Signal and The Noise Nate Silver
- Friends & Family (Impossible without their support! Special Thanks to N.S.)
^ given internet access - an issue near and dear to me.