- Note this is a preliminary repository and *no effort* was spent getting REM to work well with TD3 (as it doesn't work).
- This repository should should not be used for benchmarking and only be used a starting point for MuJoCo experiments.
If you use this code, please cite the the paper. To launch batch experiments with RSEM
, use the file run_main.sh
. To generate data, use run_expert.sh
. RSEM
works somewhat but not well though.
Method is tested on MuJoCo continuous control tasks in OpenAI gym. Networks are trained using PyTorch 0.4 and Python 2.7.