-
-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Proposal] Mujoco-v5 #91
Comments
Can you elaborate more on Proposal 1 and how would this work. I don't know if you mean to register |
I mean adding them to, to the suite of available environments. |
Im unconvinced by the test showing this to be an issue. If we make this change, we will need to train agents to ensure there is no performance regression is a result of the change. It is a shame we can't do v4.1 as the environment number |
No, the test is indeed not good enough to prove the point (and it does fail after the fix to the humanoid) The analysis done in the issues proves it I have done testing for millions of steps, with them being zero (it is in the other issues) |
Could this include new compiled versions of all of the models so they have the same format |
I would rather not, I am not sure if I can assert identical behavior because they add |
@pseudo-rnd-thoughts Reason as to why I do not trust the
Notice that for the first 20k steps |
Very interesting, thanks for doing that |
Can you please provide us with the complete model for each of the three cases? |
Hey @saran-t, I am currently doing more testing, |
Please try the auto-converted XML in https://colab.research.google.com/drive/1slY_8RlzzRffDQhLt3uqazyD_OU9pd2r#scrollTo=lH94cPjK4SKZ and let me know how that works. |
@saran-t you can use a service like https://www.diffchecker.com/text-compare/ to compare them |
That is literally the whole point? The only difference that you're seeing is just the tail end of the precision. You can see at the bottom of the Colab that the auto-converted XMLs have bit-identical results. To be clear, what I'm saying is you should just replace your existing XML with https://colab.research.google.com/drive/1slY_8RlzzRffDQhLt3uqazyD_OU9pd2r#scrollTo=s4DDElgo4_d6 (which is the absolute minimal diff that results in absolutely no change whatsoever under MuJoCo 2.3.3). |
@saran-t k, I will test it I am under the impression that the only change was printing my numbers at full prec >>> numpy.array(-0.19999999999999996, dtype=numpy.float32) == numpy.array(-0.2, dtype=numpy.float32)
True i will test your manually made xml |
Your problem there is |
Notice that for the first 20k steps |
Not really sure how that can be the case since I've verified that the model is literally identical (bit perfect) to the v4 XML... |
I am double-checking, this weirds me out too. |
I re-run the benchmark (I had (The other models have different behavior, I am not showing them)
questions 2-4 are there to know if we can change the model for environment versions |
When this is done, I think we should write a blog post as this will be a better way of publishing the changes and for users to cite the changes compared to a PR. |
Sure, where would the blog post be published? (farama.org?) In the meantime the active changelog is here: #104 |
Proposal 0
After the
Gymnasium/envs/mujoco
gets moved here, create a new revision (v5), that fixes the 0 const observations in the following domains:Humanoid (+standup): Farama-Foundation/Gymnasium#204, Farama-Foundation/Gymnasium#504
Ant: Farama-Foundation/Gymnasium#204, Farama-Foundation/Gymnasium#214
Reacher: Farama-Foundation/Gymnasium#220
InvertedDoublePendulum: Farama-Foundation/Gymnasium#228, Farama-Foundation/Gymnasium#500
InvertedPendulum: Farama-Foundation/Gymnasium#500
Note: I have written a quick test, that only fails in the above environments
Proposal 1
Register some new environments to the suite from
mamujoco
like coupled half cheetah https://github.com/Kallinteris-Andreas/Gymnasium-Robotics-Kalli/blob/main/gymnasium_robotics/envs/multiagent_mujoco/coupled_half_cheetah.py#L27Proposal 2
Update
Hopper
andWalker2D
models to not requirecoordinate="Global"
: google-deepmind/mujoco#833And fix Walker2D feet Farama-Foundation/Gymnasium#477
Checklist
The text was updated successfully, but these errors were encountered: