DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690

SiyuanLee · 2018-10-30T10:33:45Z

Hi all, I have run DDPG with default hyperparameters in mujoco swimmer-v2 environment, but the reward converges to a very low value, only 4 or 5, so the swimmer cannot swim at all. I did not change the code, and run with the script: python -m baselines.run --alg=ddpg --env=Swimmer-v2 --num_timesteps=1e6 . I don't know where is wrong. Thank you for your help.

iswaverly · 2018-12-16T03:48:28Z

Have you ever try HalfCheetah? I got problem with it either. #764

pzhokhov · 2018-12-21T21:02:10Z

I suspect the issue is with hyperparameters (it would be uninteresting if the default ones worked in all environments, would it not ? ;) @iswaverly @SiyuanLee if you find the hyperparameter setting that works, please post it here.

Yeosangho · 2019-01-03T12:57:29Z

I have same issue. DDPG of baselines doesn't training or training very slowly in mujoco enviroments that I tested (Halfcheetah, Walker2d)

But I think, that is not caused by hyper parameter settings because, hyperparameter that I checked is same to original paper of DDPG. however, I checked that network structure is different from original paper.

sayomakinwa · 2019-02-25T22:01:28Z

I have a similar situation with DDPG on Humanoid-v2 environment; it doesn't converge. Suggestions will be highly appreciated

QiXuanWang · 2019-05-06T05:44:51Z

Actually I tried some other envs(MountainCarContinuous-v0, CartPole-v0, etc). None of them give me positive returns. While using another implementation, MountainCar give me positive return. Not sure if it's DDPG implementation issue or just hyper-parameter tuning issue...

DanielTakeshi mentioned this issue Jun 20, 2019

DDPG implementation fails to learn well on at least five MuJoCo-v2 envs for all three noise types. I report steps to reproduce and learning curve plots [and show that PPO2 seems to work fine]. #938

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690

DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690

SiyuanLee commented Oct 30, 2018

iswaverly commented Dec 16, 2018

pzhokhov commented Dec 21, 2018

Yeosangho commented Jan 3, 2019 •

edited

Loading

sayomakinwa commented Feb 25, 2019

QiXuanWang commented May 6, 2019

DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690

DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690

Comments

SiyuanLee commented Oct 30, 2018

iswaverly commented Dec 16, 2018

pzhokhov commented Dec 21, 2018

Yeosangho commented Jan 3, 2019 • edited Loading

sayomakinwa commented Feb 25, 2019

QiXuanWang commented May 6, 2019

Yeosangho commented Jan 3, 2019 •

edited

Loading