We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
YAWNING-TITAN/yawning_titan/envs/generic/generic_env.py
Line 263 in 8c8a1af
It should be the following.
if self.network_interface.reward_end_multiplier: # FIXED reward = self.network_interface.reward_success * ( len(self.network_interface.get_nodes(filter_true_safe=True)) / self.network_interface.get_number_of_nodes() ) # incorrect below # reward = self.network_interface.reward_end_multiplier * ( # len(self.network_interface.get_nodes(filter_true_safe=True)) # / self.network_interface.get_number_of_nodes() # )
since in the NetworkInterface class the variables are defined as following (although their names should likely be swapped).
self.reward_success = self.reward_settings["rewards_for_reaching_max_steps"] self.reward_end_multiplier = self.reward_settings[ "end_rewards_are_multiplied_by_end_state" ]
The text was updated successfully, but these errors were encountered:
Added as a new IDT Jira issue (AIDT-64)
Sorry, something went wrong.
Merged PR 49: AIDT-64 Incorrect reward for blue agent reaching max_st…
0dfb749
…eps (GitHub Issue #10) Correct calculation for the reward multiplier. Was multiplying a boolean instead of the reward for reaching max steps
3f70276
ChrisMcCarthyDev
No branches or pull requests
YAWNING-TITAN/yawning_titan/envs/generic/generic_env.py
Line 263 in 8c8a1af
It should be the following.
since in the NetworkInterface class the variables are defined as following (although their names should likely be swapped).
The text was updated successfully, but these errors were encountered: