Add sticky actions for Atari games #1286

qgallouedec · 2023-01-18T16:37:52Z

Description

Whether this default setting will change in the future is still an open question.

This is the implementation that uses the environment argument repeat_action_probability. But I find it confusing: why would repeat_action_probability be an argument to make_atari_env and not to AtariWrapper? I am open to change this implementation, but it requires adding a new wrapper RepeatActionWrapper.

Motivation and Context

I have raised an issue to propose this change (required for new features and bug fixes)

closes #271

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Note: You can run most of the checks using make commit-checks.

Note: we are using a maximum length of 127 characters per line

stable_baselines3/common/atari_wrappers.py

qgallouedec · 2023-01-20T14:19:24Z

I finally opted for the natural way from a user point of view.

The implementation is strictly equivalent to that of ALE, namely:

On reset, the previous action is initialized to NOOP:

https://github.com/mgbellemare/Arcade-Learning-Environment/blob/49154a3e96f0858e5e9a1f41c89cfe18e6741468/src/environment/stella_environment.cpp#L194

On step, the previous action is repeated with a probability of action_repeat_probability:

https://github.com/mgbellemare/Arcade-Learning-Environment/blob/49154a3e96f0858e5e9a1f41c89cfe18e6741468/src/environment/stella_environment.cpp#L166

qgallouedec · 2023-01-20T14:45:01Z

Last thing to clarify: should I put the new arg at the end of the arg list?

araffin · 2023-01-23T16:05:14Z

Last thing to clarify: should I put the new arg at the end of the arg list?

I would say so, to avoid any bad surprises.

araffin · 2023-01-23T16:08:10Z

stable_baselines3/common/atari_wrappers.py

+        if action_repeat_probability > 0.0:
+            env = StickyActionEnv(env, action_repeat_probability)
+        if noop_max > 0:
+            env = NoopResetEnv(env, noop_max=noop_max)
        env = MaxAndSkipEnv(env, skip=frame_skip)


maybe skip that one if frame_skip<=1 (need to check if it is <=0 or <=1, but if I recall, it should have been called action repeat.

stable_baselines3/common/atari_wrappers.py

araffin

apart from minor comment, LGTM =)

qgallouedec and others added 8 commits January 18, 2023 16:47

repeat_action_probability

d702f42

Add test

ffa0818

Undo atari wrapper doc change since CI fails

8668077

remove action_repeat_probability from make_atari_env

a87cb7d

Add sticky action wrapper and improve documentation

3592a34

Update changelog

47e4fb6

handle the case noop_max=0

970c753

Update tests

8fc77a0

araffin reviewed Jan 20, 2023

View reviewed changes

stable_baselines3/common/atari_wrappers.py Show resolved Hide resolved

Comply to ALE implementation

c588a37

qgallouedec added 2 commits January 20, 2023 15:23

Reorder doc

7a14fec

Add doc warning and don't wrap with sticky action when not needed

b287bb4

qgallouedec marked this pull request as ready for review January 20, 2023 14:45

qgallouedec and others added 2 commits January 20, 2023 15:59

fix docstring and reorder

35d4eb2

Merge branch 'master' into feat/repeat_action_probability

c4bd1cb

Merge branch 'master' into feat/repeat_action_probability

bf953a7

araffin reviewed Jan 23, 2023

View reviewed changes

stable_baselines3/common/atari_wrappers.py Show resolved Hide resolved

Move action_repeat_probability args at the last position

819d7db

araffin reviewed Jan 25, 2023

View reviewed changes

stable_baselines3/common/atari_wrappers.py Show resolved Hide resolved

araffin reviewed Jan 25, 2023

View reviewed changes

Add ref

5351c0e

araffin changed the title ~~Add repeat_action_probability arg to make_atari_env~~ Add sticky actions for Atari games Jan 25, 2023

araffin added 2 commits January 26, 2023 00:16

Update doc and wrap with frameskip only if needed

f504eb6

Update changelog

50192df

araffin approved these changes Jan 25, 2023

View reviewed changes

Merge branch 'master' into feat/repeat_action_probability

b4a1a1b

qgallouedec merged commit 5ee9009 into master Jan 26, 2023

qgallouedec deleted the feat/repeat_action_probability branch January 26, 2023 09:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sticky actions for Atari games #1286

Add sticky actions for Atari games #1286

qgallouedec commented Jan 18, 2023 •

edited

Loading

qgallouedec commented Jan 20, 2023

qgallouedec commented Jan 20, 2023

araffin commented Jan 23, 2023

araffin Jan 23, 2023

araffin left a comment

Add sticky actions for Atari games #1286

Add sticky actions for Atari games #1286

Conversation

qgallouedec commented Jan 18, 2023 • edited Loading

Description

Motivation and Context

Types of changes

Checklist

qgallouedec commented Jan 20, 2023

qgallouedec commented Jan 20, 2023

araffin commented Jan 23, 2023

araffin Jan 23, 2023

Choose a reason for hiding this comment

araffin left a comment

Choose a reason for hiding this comment

qgallouedec commented Jan 18, 2023 •

edited

Loading