Update sb3_kaz_vector.py #1132

Fernadoo · 2023-11-14T13:50:40Z

Description

In the tutorial of utilizing SB3 for the KAZ environment, the original author updates the reward Dict using the same variable name in an inner loop as that used for env.agent_iter(), which leads to the consequence that the actual decision-maker is always the last agent namely knight_1. I have just revised it to a different name and it works fine.

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have run pytest -v and no errors are present.
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I solved any possible warnings that pytest -v has generated that are related to my code to the best of my knowledge.
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Fixed a bug in agent iteration

elliottower · 2023-11-14T17:29:31Z

Thanks for finding this, makes sense cause I remember testing it locally and it seemed like it wasn’t learning for the other agents, figured it was due to the number of steps but this makes way more sense. Any chance you could check to see if any of the other SB3 tutorials have issues? I mainly adapted the old SB3 tutorial jordan (original creator of PZ) did on a medium article, but I may not have done everything correctly. Have had a bunch of people asking questions about it, haven’t had time to look deeply myself and am not an expert when it comes to training algorithms and such.

tutorials/SB3/kaz/sb3_kaz_vector.py

elliottower · 2023-11-14T17:34:06Z

Looks like the other tutorials have the same error, but I can just fix it myself.

Fernadoo · 2023-11-15T02:39:01Z

Thanks for finding this, makes sense cause I remember testing it locally and it seemed like it wasn’t learning for the other agents, figured it was due to the number of steps but this makes way more sense. Any chance you could check to see if any of the other SB3 tutorials have issues? I mainly adapted the old SB3 tutorial jordan (original creator of PZ) did on a medium article, but I may not have done everything correctly. Have had a bunch of people asking questions about it, haven’t had time to look deeply myself and am not an expert when it comes to training algorithms and such.

Sure thing, I'll try to help debug those tutorials once I have some spare time 🥸

elliottower · 2023-11-15T20:46:07Z

Closing this as I'm just going to fix the pre-commit and do the other tutorials as well, I've branched from your branch so you'll still get authorship credit

Fernadoo added 2 commits November 14, 2023 21:33

Update sb3_kaz_vector.py

8d88172

Fixed a bug in agent iteration

Update sb3_kaz_vector.py

8fa57a7

elliottower reviewed Nov 14, 2023

View reviewed changes

tutorials/SB3/kaz/sb3_kaz_vector.py Show resolved Hide resolved

elliottower approved these changes Nov 14, 2023

View reviewed changes

Fernadoo requested a review from elliottower November 15, 2023 00:56

elliottower mentioned this pull request Nov 15, 2023

Fix sb3 tutorials typo #1133

Merged

7 tasks

elliottower closed this Nov 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update sb3_kaz_vector.py #1132

Update sb3_kaz_vector.py #1132

Fernadoo commented Nov 14, 2023

elliottower commented Nov 14, 2023

elliottower commented Nov 14, 2023

Fernadoo commented Nov 15, 2023

elliottower commented Nov 15, 2023

Update sb3_kaz_vector.py #1132

Update sb3_kaz_vector.py #1132

Conversation

Fernadoo commented Nov 14, 2023

Description

Type of change

Checklist:

elliottower commented Nov 14, 2023

elliottower commented Nov 14, 2023

Fernadoo commented Nov 15, 2023

elliottower commented Nov 15, 2023