PyMC3 Vs PyStan Comparison

Jonathan Sedar Personal Project

PyMC3 Vs PyStan Comparison

Spring 2016

This set of Notebooks and scripts comprise the pymc3_vs_pystan personal project by Jonathan Sedar of Applied AI Ltd, written primarily for presentation at the PyData London 2016 Conference.

The project demonstrates hierarchical linear regression using two Bayesian inference frameworks: PyMC3 and PyStan. The project borrows heavily from code written for Applied AI Ltd and is supplied here for educational purposes only. No copyright or license is extended to users.

Copyright Applied AI Ltd 2016

Development

Git clone the repo to your workspace.

e.g. in Mac OSX terminal:

    $> git clone https://github.com/jonsedar/pymc3_vs_pystan.git
    $> cd pymc3_vs_pystan

NOTES:

This project uses Python 3.5, and was developed on a Macbook Pro OSX 10.10.5 using the Anaconda distro with a new virtualenv on 19 April 2016
The project requires PyMC3 (with an associated Theano install) and PyStan (with an associated Stan install) so is quite heavy.
Specific versions of key packages for clarity: pymc3-3.0, theano-0.8.1, pystan-2.9.0.0

Setup a virtual environment for Python libraries

Using create a new virtualenv, installing packages from env YAML file:

 $> conda env create --file conda_env_pymc3_vs_pystan.yml
 $> source activate pymc3_vs_pystan

install remaining packages via pip (inc pymc3 master with deps):
```
 $> ./pip_install.sh
```
Launch Jupyter Notebook server
```
 $> jupyter notebook
```

Data

Local data is not stored in the repo, and should be manually copied into the subdirectory data/

See file data/README_DATA.md for more info

General Notes:

File hack_findmap.py contains a customised find_MAP() function to correct for pymc3's default behaviour of computing gradients when the chosen optimizer doesn't use them. We don't want to compute gradients in large datasets because it's quite computationally expensive.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
assets		assets
data		data
models		models
traces		traces
.env		.env
.gitignore		.gitignore
00_Overview.ipynb		00_Overview.ipynb
01_BasicInstallTests.ipynb		01_BasicInstallTests.ipynb
10_DataEngineering.ipynb		10_DataEngineering.ipynb
20_InitialDataAnalysis.ipynb		20_InitialDataAnalysis.ipynb
30_BasicLinearRegression.ipynb		30_BasicLinearRegression.ipynb
31_BasicModelEvaluation.ipynb		31_BasicModelEvaluation.ipynb
32_MoreModelEvaluation_PyMC3.ipynb		32_MoreModelEvaluation_PyMC3.ipynb
40_HierarchicalLinearRegression.ipynb		40_HierarchicalLinearRegression.ipynb
80_PoissonRegression.ipynb		80_PoissonRegression.ipynb
81_PoissonRegressionSneezes.ipynb		81_PoissonRegressionSneezes.ipynb
README.md		README.md
conda_env_pymc3_vs_pystan.yml		conda_env_pymc3_vs_pystan.yml
convenience_functions.py		convenience_functions.py
daft_plots.py		daft_plots.py
hack_findmap.py		hack_findmap.py
pip_install.sh		pip_install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jonathan Sedar Personal Project

PyMC3 Vs PyStan Comparison

Development

Git clone the repo to your workspace.

Setup a virtual environment for Python libraries

Data

General Notes:

About

Releases

Packages

Contributors 2

Languages

jonsedar/pymc3_vs_pystan

Folders and files

Latest commit

History

Repository files navigation

Jonathan Sedar Personal Project

PyMC3 Vs PyStan Comparison

Development

Git clone the repo to your workspace.

Setup a virtual environment for Python libraries

Data

General Notes:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages