MAP Policy Optimisation (MPO)

Inspired by implementation of daisatojp. Basic framework and SAC implementation are mostly taken from OpenAI SpinningUp

References:

MPO: link SAC: link RERPI: link

Installation

Python 3.9+ and working MuJoCo installation are required. Optional: Create conda environment with

conda create -n myenv python=3.9

Standard installation with pip

git clone https://github.com/freiberg-roman/mpo.git
cd mpo
pip install -e ".[dev]"

Test installation by running

python -m mpo.examples.main algorithm=mpo q_learning=retrace overrides=pendulum

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
mpo		mpo
requirements		requirements
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproyect.toml		pyproyect.toml
setup.cfg		setup.cfg
setup.py		setup.py