This repo holds project reports and my resume for my data science work.
- Resume holds the latest copy of my resume in PDF format
- Reports holds project reports for my academic work at Northwestern University.
Here's a brief summary of the projects:
- Wins per game predictive modeling for over 100 years of baseball data using regression and decision trees using glm, caret, broom.
- Wine case purchase volume predictive modeling using poisson, negative binomial, and hurdle models using glm, caret, broom
- Autocrashes predictive modeling using ordinary least squares (OLS), logistic regression
- Discrete choice experiment modeling using Hierarchical Bayes Multinomial Logit to select product design
- Model to predict target market for campaign using random forests and naïve bayes models
- Customer segmentation modeling using hierarchical agglomerative clustering and k-means followed by market segmentation profiling
- Time series forecasting for the DengAI, disease spread competition, models using xts, forecast, and neural networks
- Model to predict donation amounts for a not-for-profit marketing campaign using a variety of machine learning modeling approaches including boosting, bagging, random forest, and PCR using caret