Skip to content

Using Open AI spinning up environment for MPO implementation

Notifications You must be signed in to change notification settings

freiberg-roman/mpo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MAP Policy Optimisation (MPO)

Inspired by implementation of daisatojp. Basic framework and SAC implementation are mostly taken from OpenAI SpinningUp

References:

MPO: link SAC: link RERPI: link

Installation

Python 3.9+ and working MuJoCo installation are required. Optional: Create conda environment with

conda create -n myenv python=3.9

Standard installation with pip

git clone https://github.com/freiberg-roman/mpo.git
cd mpo
pip install -e ".[dev]"

Test installation by running

python -m mpo.examples.main algorithm=mpo q_learning=retrace overrides=pendulum

About

Using Open AI spinning up environment for MPO implementation

Resources

Stars

Watchers

Forks

Languages