Refactor mujoco envs to support dynamic arguments #1304

hartikainen · 2019-02-02T06:39:50Z

I'm glad to see that gym finally supports dynamic parameterization of the environments (#1301).

We've been using custom parameterizable versions of the basic mujoco environments in our softlearning codebase for a while now, and I thought I would share those environments here as well. The implementations are imo a bit more readable and they support parameterization of the environments via init args. For example, certain environments can be made to not terminate and all the rewards and costs can be tuned via init arguments. With the default parameters, these environments are exact copies of the original environments (I've verified the step and reset functions by testing them against the old implementations).

Currently, the only refactored environments are Ant, Hopper, HalfCheetah, Humanoid, Swimmer, and Walker2d. I'm happy to refactor the rest of the mujoco environments if these seem useful.

pzhokhov · 2019-02-09T00:29:46Z

this looks neat to me! I am a little hesitant about the exact reproduction of the original mujoco envs - could you add some unit tests to that extent? For instance, instead of replacing the old mujoco envs you could create the -v3 version of them, and add unit tests that compare v2 and v3. If all of them pass, we can create a separate PR removing the non-configurable versions.

hartikainen · 2019-02-09T01:35:40Z

@pzhokhov thanks for the comments. I moved the new environments under v3, and added tests cases to make sure that all the new environments match the corresponding old ones.

pzhokhov · 2019-02-16T02:41:16Z

@hartikainen there were some issues with the tests requiring mujoco; I have added respective skip modifiers and made a new PR: #1328. Unfortunately, I squash-merged your branch instead of merging it; which removed commit history. Feel free to either update your PR with the changes (the only changes are in gym/env/tests/spec_list.py and gym/envs/tests/test_mujoco_v2_to_v2_conversion.py

This reverts commit 4a3a74c.

This reverts commit 62b4bcf.

This reverts commit b2a439f.

hartikainen · 2019-02-16T05:13:36Z

@pzhokhov I merged your changes into this branch and at the same time rebased this one from master. Let me know if there's anything else that needs to change.

pzhokhov · 2019-02-25T23:12:11Z

Awesome, thanks! Merging

hartikainen · 2019-02-26T23:10:40Z

Nice, thanks @pzhokhov!

* Refactor gym envs to support dynamic arguments * Fix viewer setup lookat configuration * Add xml_file argument for mujoco envs * Move refactored mujoco envs to their own _v3.py files * Revert "Add xml_file argument for mujoco envs" This reverts commit 4a3a74c. * Revert "Fix viewer setup lookat configuration" This reverts commit 62b4bcf. * Revert "Refactor gym envs to support dynamic arguments" This reverts commit b2a439f. * Fix v3 SwimmerEnv info * Regiter v3 mujoco environments * Implement v2 to v3 conversion test * Add extra step info the v3 environments * polish the new unit tests a little bit

hartikainen changed the title ~~Refactor gym envs to support dynamic arguments~~ Refactor mujoco envs to support dynamic arguments Feb 2, 2019

hartikainen force-pushed the refactor/parameterizable-mujoco-envs branch from d78f1c6 to 671d82a Compare February 9, 2019 01:34

hartikainen and others added 12 commits February 15, 2019 21:07

Refactor gym envs to support dynamic arguments

42ceebd

Fix viewer setup lookat configuration

506719b

Add xml_file argument for mujoco envs

2806325

Move refactored mujoco envs to their own _v3.py files

5a1f132

Revert "Add xml_file argument for mujoco envs"

256b63d

This reverts commit 4a3a74c.

Revert "Fix viewer setup lookat configuration"

02b223d

This reverts commit 62b4bcf.

Revert "Refactor gym envs to support dynamic arguments"

3f90f9c

This reverts commit b2a439f.

Fix v3 SwimmerEnv info

e140e4b

Regiter v3 mujoco environments

2ee49ba

Implement v2 to v3 conversion test

4e241da

Add extra step info the v3 environments

901e005

polish the new unit tests a little bit

f84c1f3

hartikainen force-pushed the refactor/parameterizable-mujoco-envs branch from 671d82a to f84c1f3 Compare February 16, 2019 05:12

pzhokhov merged commit 90a0564 into openai:master Feb 25, 2019

hartikainen deleted the refactor/parameterizable-mujoco-envs branch February 26, 2019 23:10

hartikainen mentioned this pull request Mar 3, 2019

Update gym version rail-berkeley/softlearning#50

Merged

matthiasplappert mentioned this pull request Apr 6, 2019

Discrepancy in MuJoCo Humanoid environment #1380

Closed

sgillen mentioned this pull request Sep 11, 2021

Replacing gym's Mujoco envs with brax envs google/brax#49

Open

qgallouedec mentioned this pull request Feb 2, 2023

HumanoidStandup-v3, Reacher-v3 and Inverted[Double]Pendulum-v3 not found in gym registry DLR-RM/rl-baselines3-zoo#350

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor mujoco envs to support dynamic arguments #1304

Refactor mujoco envs to support dynamic arguments #1304

hartikainen commented Feb 2, 2019 •

edited

Loading

pzhokhov commented Feb 9, 2019

hartikainen commented Feb 9, 2019 •

edited

Loading

pzhokhov commented Feb 16, 2019 •

edited

Loading

hartikainen commented Feb 16, 2019 •

edited

Loading

pzhokhov commented Feb 25, 2019

hartikainen commented Feb 26, 2019

Refactor mujoco envs to support dynamic arguments #1304

Refactor mujoco envs to support dynamic arguments #1304

Conversation

hartikainen commented Feb 2, 2019 • edited Loading

pzhokhov commented Feb 9, 2019

hartikainen commented Feb 9, 2019 • edited Loading

pzhokhov commented Feb 16, 2019 • edited Loading

hartikainen commented Feb 16, 2019 • edited Loading

pzhokhov commented Feb 25, 2019

hartikainen commented Feb 26, 2019

hartikainen commented Feb 2, 2019 •

edited

Loading

hartikainen commented Feb 9, 2019 •

edited

Loading

pzhokhov commented Feb 16, 2019 •

edited

Loading

hartikainen commented Feb 16, 2019 •

edited

Loading