Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690

Open
SiyuanLee opened this issue Oct 30, 2018 · 5 comments
Open

Comments

@SiyuanLee
Copy link

Hi all, I have run DDPG with default hyperparameters in mujoco swimmer-v2 environment, but the reward converges to a very low value, only 4 or 5, so the swimmer cannot swim at all. I did not change the code, and run with the script: python -m baselines.run --alg=ddpg --env=Swimmer-v2 --num_timesteps=1e6 . I don't know where is wrong. Thank you for your help.

@iswaverly
Copy link

Have you ever try HalfCheetah? I got problem with it either. #764

@pzhokhov
Copy link
Collaborator

I suspect the issue is with hyperparameters (it would be uninteresting if the default ones worked in all environments, would it not ? ;) @iswaverly @SiyuanLee if you find the hyperparameter setting that works, please post it here.

@Yeosangho
Copy link

Yeosangho commented Jan 3, 2019

I have same issue. DDPG of baselines doesn't training or training very slowly in mujoco enviroments that I tested (Halfcheetah, Walker2d)

But I think, that is not caused by hyper parameter settings because, hyperparameter that I checked is same to original paper of DDPG. however, I checked that network structure is different from original paper.

@sayomakinwa
Copy link

I have a similar situation with DDPG on Humanoid-v2 environment; it doesn't converge. Suggestions will be highly appreciated

@QiXuanWang
Copy link

Actually I tried some other envs(MountainCarContinuous-v0, CartPole-v0, etc). None of them give me positive returns. While using another implementation, MountainCar give me positive return. Not sure if it's DDPG implementation issue or just hyper-parameter tuning issue...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants