Update hanabi to use shimmy[openspiel] instead of Hanabi Learning Env #933

elliottower · 2023-04-02T20:29:21Z

Description

This PR updates hanabi to use OpenSpiel's Hanabi implementation rather than hanabi learning environment. This implementation is in C++ and has been updated as recently as 4 weeks ago, whereas hanabi learning environment has not been updated since 2021. There is a flag in openspiel to use the hanabi learning environment but we don't have it enabled as there's not much point. The hanabi learning environment seems to be very limited from what I can tell, and this is a much more full featured implementation. It is also listed as thoroughly tested under openspiel's available games list: https://openspiel.readthedocs.io/en/latest/games.html.

The previous hanabi implementation (v4) used very different ways of doing things from all of the other games in pettingzoo.classic, which makes sense as it was last updated in pettingzoo v1.8.0.

This PR also updates the API test and render test code to properly handle action masking from pettingzoo's classic games, contained in obs['action_mask']. Previously, the code used numpy to randomly sample from the action space, but this led to errors if the action mask has zero valid actions, whereas my code using action_space(agent).sample(mask) works how we expect it to, generating a valid action in the space, which then causes the environment to terminate (as the action is illegal). I did some other cleanups but the tests are still passing, mostly just code style changes and accounting for the possibility of an action mask.

Unfortunately I was not able to get hanabi's seed test to work, which is bizarre because in Shimmy the openspiel seed tests actually do work but the API test doesn't. I think the API test is much more important to get working and I spent the majority of my time on that, but I can delay merging this and see if it's possible to fix the errors. The seed tests in PettingZoo should probably be updated as well, they are very confusing to debug and totally unlike the testing used in gymnasium and which we did in shimmy to test pettingzoo environments, which I think was much cleaner (adapted gymnasium's seed test code).

I think if I do update the seed tests to match these other repos they may end up passing for Hanabi, but I'm not exactly sure what the current seed tests do in comparison to it (wouldn't want to change it and remove functionality), so maybe someone could help figure that out before I make those changes.

Fixes # (issue), Depends on # (pull request)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Screenshots

Please attach before and after screenshots of the change if applicable.
To upload images to a PR -- simply drag and drop or copy paste.

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have run pytest -v and no errors are present.
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I solved any possible warnings that pytest -v has generated that are related to my code to the best of my knowledge.
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

elliottower · 2023-04-02T21:25:04Z

The render tests work with the most recent version of shimmy where I enabled rendering for openspiel envs, once that is released this will pass.

jjshoots

Some minor questions, but other than that fairly sane.

test/all_parameter_combs_test.py

pyproject.toml

pettingzoo/test/api_test.py

pettingzoo/classic/hanabi/hanabi.py

jjshoots

Some general comments, but otherwise LGTM.

pettingzoo/test/api_test.py

jjshoots · 2023-04-07T16:37:38Z

pettingzoo/test/api_test.py

+    if isinstance(observation_0, dict) and "observation" in observation_0:
+        observation_0 = observation_0["observation"]
+


This looks hacky but, I guess it works.

Otherwise I would've had to change like 4 different other calls which use observation_0 and expect it to be an obs, rather than a dict. This is a bit unclear to understand but what it does I think is fine, it results in errors otherwise.

jjshoots

Edit: once tests pass.

pettingzoo/test/api_test.py

…o hanabi-shimmy

pseudo-rnd-thoughts · 2023-04-12T09:16:07Z

Given the complexity of the PR, there seems to be several different PRs happening here, updating test_seeding, docs updates and the hanabi change. Personally, I would separate out the PR and this will make it simpler to solve the issue

elliottower · 2023-04-12T14:05:50Z

Given the complexity of the PR, there seems to be several different PRs happening here, updating test_seeding, docs updates and the hanabi change. Personally, I would separate out the PR and this will make it simpler to solve the issue

Yea makes sense, I was thinking the same and believe I mentioned it in the pz dev channel about how it should probably be split unless we want to make an exception.

pseudo-rnd-thoughts · 2023-04-12T14:48:48Z

It is simpler 90% of the time to have several prs if you want to fix several issues unless you are changing a total of 20 lines or less

elliottower · 2023-04-13T19:25:09Z

Closing in favor of separate PRs

elliottower added 2 commits April 2, 2023 16:16

Update hanabi to use shimmy[openspiel] instead of Hanabi Learning Env

8601497

Fix render test to use action_space.sample() for action mask

fb2fc0f

elliottower requested review from jjshoots and pseudo-rnd-thoughts April 2, 2023 20:32

Remove hanabi learning env req, shimmy as 'all' req

57d330c

jjshoots requested changes Apr 3, 2023

View reviewed changes

test/all_parameter_combs_test.py Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

pettingzoo/test/api_test.py Outdated Show resolved Hide resolved

pettingzoo/test/api_test.py Outdated Show resolved Hide resolved

jjshoots reviewed Apr 3, 2023

View reviewed changes

pettingzoo/classic/hanabi/hanabi.py Outdated Show resolved Hide resolved

elliottower and others added 3 commits April 4, 2023 15:45

Remove unnecessary check for info[action_mask], fix old citation

3c3971b

Fix typos and clean up code from PR feedback

21f55ab

Merge branch 'master' into hanabi-shimmy

b3fe068

jjshoots approved these changes Apr 7, 2023

View reviewed changes

jjshoots requested changes Apr 7, 2023

View reviewed changes

elliottower and others added 3 commits April 7, 2023 12:48

Update api_test.py

07009c8

Temporary fix for rendering not implemented error

4ebc2b6

Pre-commit

91b880e

jjshoots requested changes Apr 7, 2023

View reviewed changes

pettingzoo/test/api_test.py Outdated Show resolved Hide resolved

elliottower added 8 commits April 7, 2023 14:29

Clean up action masking code per jet's suggestions

9141084

fix typo in observation dict handling, fix dtype mismatch error

a7e037f

Pre-commit

f043896

Re-write seed test to match gymnasium/shimmy seed tests

1512ced

Used same function names and functionality as old seed_test

b968529

Fixed seed test call for hanabi (unsupported)

c0fa964

Attempt to fix generated_agents variable env tests

8785eb6

Fix typos and add temp code to debug issue

b368d31

jjshoots mentioned this pull request Apr 11, 2023

Fix seed test #944

Closed

elliottower and others added 4 commits April 11, 2023 13:57

Fix variable env test and add obs seeding to check_env_deterministic

b1c17a2

Merge branch 'Farama-Foundation:master' into hanabi-shimmy

e55cd71

Remove teporary fix in seed test code

b18a16f

Disable failing waterworld tests, add comment explaining bug

ae13b4e

elliottower and others added 3 commits April 11, 2023 21:00

Merge branch 'hanabi-shimmy' of github.com:elliottower/PettingZoo int…

6145de3

…o hanabi-shimmy

Update api_test.py

c681345

Fix pre-commit and disable waterworld seed tests

245e2a9

This was referenced Apr 13, 2023

Update testing #946

Merged

Wrappers sisl docs update #947

Merged

Hanabi update #948

Merged

elliottower closed this Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update hanabi to use shimmy[openspiel] instead of Hanabi Learning Env #933

Update hanabi to use shimmy[openspiel] instead of Hanabi Learning Env #933

elliottower commented Apr 2, 2023 •

edited

Loading

elliottower commented Apr 2, 2023

jjshoots left a comment

jjshoots left a comment

jjshoots Apr 7, 2023

elliottower Apr 12, 2023

jjshoots left a comment

pseudo-rnd-thoughts commented Apr 12, 2023

elliottower commented Apr 12, 2023

pseudo-rnd-thoughts commented Apr 12, 2023

elliottower commented Apr 13, 2023

		if isinstance(observation_0, dict) and "observation" in observation_0:
		observation_0 = observation_0["observation"]

Update hanabi to use shimmy[openspiel] instead of Hanabi Learning Env #933

Update hanabi to use shimmy[openspiel] instead of Hanabi Learning Env #933

Conversation

elliottower commented Apr 2, 2023 • edited Loading

Description

Type of change

Screenshots

Checklist:

elliottower commented Apr 2, 2023

jjshoots left a comment

Choose a reason for hiding this comment

jjshoots left a comment

Choose a reason for hiding this comment

jjshoots Apr 7, 2023

Choose a reason for hiding this comment

elliottower Apr 12, 2023

Choose a reason for hiding this comment

jjshoots left a comment

Choose a reason for hiding this comment

pseudo-rnd-thoughts commented Apr 12, 2023

elliottower commented Apr 12, 2023

pseudo-rnd-thoughts commented Apr 12, 2023

elliottower commented Apr 13, 2023

elliottower commented Apr 2, 2023 •

edited

Loading