You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have checked that there is no similar issue in the repo (required)
Possible Solution
StabeBaselines' ReplayBuffer currently does not support the new format returned by gym.Env.step. Their step api changed from: obs, rew, done, info = env.step(action) to obs, rew, terminated, truncated, info = env.step(action).
We would need to implement a slightly modified version of the ReplayBuffer in cleanRL itself. Other than this, the changes required are minimal.
I can submit an initial PR with changes required for SAC if you're interested.
The text was updated successfully, but these errors were encountered:
Update on the ticket - the current gym master is set to release 0.26.0 which enables obs, rew, terminated, truncated, info = env.step(action) by default.
vwxyzjn
changed the title
Upgrade gym version to 0.25.1
Upgrade gym version to 0.26.1
Oct 3, 2022
Problem Description
Upgrade gym version used in cleanrl from 0.23.1 to 0.25.1
Checklist
poetry install
(see CleanRL's installation guideline.Possible Solution
StabeBaselines' ReplayBuffer currently does not support the new format returned by
gym.Env.step
. Their step api changed from:obs, rew, done, info = env.step(action)
toobs, rew, terminated, truncated, info = env.step(action)
.We would need to implement a slightly modified version of the ReplayBuffer in cleanRL itself. Other than this, the changes required are minimal.
I can submit an initial PR with changes required for SAC if you're interested.
The text was updated successfully, but these errors were encountered: