DDPG doesn't work #855

DennisWangCW · 2019-03-17T12:24:24Z

I downloaded the baselines code, installed it using pip install -e ., and ran DDPG using the proposed command line. I've tested it on several mujoco environments but none of them converged at all. I didn't edit the code and want to know what's wrong with it? Hope for reply, thanks.

DanielTakeshi · 2019-03-25T15:30:13Z

It's helpful if you can provide more information and the log files as well. For example, see the issue I filed a while back about DQN:

#431

QiXuanWang · 2019-05-06T06:18:49Z

I didn't run Mujoko but tried some other popular envs (MountainCar for example), which should converge since it's so simple. But default training with 1e6 steps give me big negative returns. Not sure what to tune... While some other implementation gave me positive return though probably the calculation method is slightly different.

DanielTakeshi mentioned this issue Jun 20, 2019

DDPG implementation fails to learn well on at least five MuJoCo-v2 envs for all three noise types. I report steps to reproduce and learning curve plots [and show that PPO2 seems to work fine]. #938

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DDPG doesn't work #855

DDPG doesn't work #855

DennisWangCW commented Mar 17, 2019

DanielTakeshi commented Mar 25, 2019

QiXuanWang commented May 6, 2019

DDPG doesn't work #855

DDPG doesn't work #855

Comments

DennisWangCW commented Mar 17, 2019

DanielTakeshi commented Mar 25, 2019

QiXuanWang commented May 6, 2019