-
Notifications
You must be signed in to change notification settings - Fork 689
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TD3: fixed dimension of clipped_noise for target actions, added noise … #281
Conversation
…scaling based on action_scale
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
@vwxyzjn td3 continuous JAX variant will probably be affected too though. |
@joaogui1 could you check if TD3 (JAX) will be affected? |
@vwxyzjn check whether the jax implementation affected by this PR or if it will also need a fix like this PR? |
@joaogui1, the latter :) |
Got it, it will need to be fix, creating the PR this moment |
@dosssman thank you for the PR! Would you mind running some benchmark experiments to see if this change has a significant impact on the performance? If not, we don't even have to update the docs, since the main purpose of re-doing benchmark is to ensure no regression in performance. Lines 3 to 7 in f0bbf49
|
There is a performance regression on the Walker2d env, but the others are only marginally affected: Report here |
Given that this is a performance-impacting change, I am re-running the benchmark now. |
No material change to the performance (there is a minor regression in |
Description
Closes #279.
Types of changes
Checklist:
pre-commit run --all-files
passes (required).[ ] I have updated the documentation and previewed the changes viamkdocs serve
.If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.
--capture-video
flag toggled on (required).mkdocs serve
.width=500
andheight=300
).