Incorrect reward for blue agent reaching max_steps #10

john-cardiff · 2022-09-06T20:57:21Z

YAWNING-TITAN/yawning_titan/envs/generic/generic_env.py

Line 263 in 8c8a1af

if self.network_interface.reward_end_multiplier:

It should be the following.

            if self.network_interface.reward_end_multiplier:
                # FIXED
                reward = self.network_interface.reward_success * (
                    len(self.network_interface.get_nodes(filter_true_safe=True))
                    / self.network_interface.get_number_of_nodes()
                )
                # incorrect below
                # reward = self.network_interface.reward_end_multiplier * (
                #     len(self.network_interface.get_nodes(filter_true_safe=True))
                #     / self.network_interface.get_number_of_nodes()
                # )

since in the NetworkInterface class the variables are defined as following (although their names should likely be swapped).

    self.reward_success = self.reward_settings["rewards_for_reaching_max_steps"]
    self.reward_end_multiplier = self.reward_settings[
        "end_rewards_are_multiplied_by_end_state"
    ]

The text was updated successfully, but these errors were encountered:

jamesshort1 · 2022-10-31T13:51:50Z

Added as a new IDT Jira issue (AIDT-64)

…eps (GitHub Issue #10) Correct calculation for the reward multiplier. Was multiplying a boolean instead of the reward for reaching max steps

ChrisMcCarthyDev added the bug Something isn't working label Dec 2, 2022

ChrisMcCarthyDev self-assigned this Feb 23, 2023

ChrisMcCarthyDev added the fix_version:1.1.0 Fixed in version 1.1.0 label Feb 23, 2023

ChrisMcCarthyDev closed this as completed Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect reward for blue agent reaching max_steps #10

Incorrect reward for blue agent reaching max_steps #10

john-cardiff commented Sep 6, 2022

jamesshort1 commented Oct 31, 2022

Incorrect reward for blue agent reaching max_steps #10

Incorrect reward for blue agent reaching max_steps #10

Comments

john-cardiff commented Sep 6, 2022

jamesshort1 commented Oct 31, 2022