-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DDPG with default hyper-paras doesn't work in mujoco swimmer-v2 env #690
Comments
Have you ever try HalfCheetah? I got problem with it either. #764 |
I suspect the issue is with hyperparameters (it would be uninteresting if the default ones worked in all environments, would it not ? ;) @iswaverly @SiyuanLee if you find the hyperparameter setting that works, please post it here. |
I have same issue. DDPG of baselines doesn't training or training very slowly in mujoco enviroments that I tested (Halfcheetah, Walker2d) But I think, that is not caused by hyper parameter settings because, hyperparameter that I checked is same to original paper of DDPG. however, I checked that network structure is different from original paper. |
I have a similar situation with DDPG on Humanoid-v2 environment; it doesn't converge. Suggestions will be highly appreciated |
Actually I tried some other envs(MountainCarContinuous-v0, CartPole-v0, etc). None of them give me positive returns. While using another implementation, MountainCar give me positive return. Not sure if it's DDPG implementation issue or just hyper-parameter tuning issue... |
Hi all, I have run DDPG with default hyperparameters in mujoco swimmer-v2 environment, but the reward converges to a very low value, only 4 or 5, so the swimmer cannot swim at all. I did not change the code, and run with the script: python -m baselines.run --alg=ddpg --env=Swimmer-v2 --num_timesteps=1e6 . I don't know where is wrong. Thank you for your help.
The text was updated successfully, but these errors were encountered: