Line that will be in maintenance next time step are not taken into account in the "simulate" function #148

DesmondZhong · 2020-09-15T23:43:41Z

Environment

Grid2op version: 1.2.2
System: Archlinux

Bug description

Line maintenance seems to be treated as an attack, which is reflected in the attack duration.

How to reproduce

Code snippet

from lightsim2grid import LightSimBackend
from grid2op import make
backend = LightSimBackend()
env = make("l2rpn_neurips_2020_track1_small", backend=backend, difficulty="0")
env.seed(3) # for reproducibility
obs = env.reset()

def print_obs(obs):
    print(f"line status: {obs.line_status}")
    print(f"attack_duration: {obs.time_before_cooldown_line}")
    print(f"time next maintenance {obs.time_next_maintenance}")
    print(f"maintenance duration {obs.duration_next_maintenance}")

print("\n-------------initial observation----------------\n")
print_obs(obs) 
# from the observation, we know line 18 is scheduled for maintenance 
# in 684 time steps, we then do nothing for 683 time steps

from grid2op.Agent import DoNothingAgent
do_nothing_agent = DoNothingAgent(env.action_space)

# do nothing for 683 time steps
for i in range(683):
    obs, reward, done, info = env.step(do_nothing_agent.act(observation=None, reward=None))

print("\n-------------observation after 683 steps-------------\n")
print_obs(obs)
# notice now line 18 is still connected in the power grid

# first simulate one time step and actually step one time step
sim_obs, sim_reward, sim_done, sim_info = obs.simulate(do_nothing_agent.act(observation=None, reward=None))
obs, reward, done, info = env.step(do_nothing_agent.act(observation=None, reward=None))

print("\n------------simulation-------------\n")
print_obs(sim_obs)
# notice in the simulation, line 18 is still connected
print("\n---------true------------\n")
print_obs(obs)
# notice in the actual step, line 18 is down, 
# and the attack_duration of line 18 change to 96, 
# which indicates line 18 is undergoing an attack

Current output

------------simulation-------------

line status: [ True  True  True  True  True  True  True  True  True  True  True  True
  True False  True  True  True  True  True  True  True  True  True False
  True  True  True  True  True  True  True  True  True  True  True  True
  True  True  True  True  True  True  True  True  True  True  True  True
  True  True  True  True  True  True  True  True False  True  True]
attack_duration: [ 0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
  0  0  0  0  0  0  0  0 31  0  0]
time next maintenance [3745   -1   -1   -1   -1   -1   -1   -1   -1 1729   -1   -1   -1   -1
   -1   -1   -1   -1    1   -1   -1   -1   -1   -1   -1   -1   -1   -1
   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1
   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1
   -1   -1   -1]
maintenance duration [96  0  0  0  0  0  0  0  0 96  0  0  0  0  0  0  0  0 96  0  0  0  0  0
  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
  0  0  0  0  0  0  0  0  0  0  0]

---------true------------

line status: [ True  True  True  True  True  True  True  True  True  True  True  True
  True False  True  True  True  True False  True  True  True  True False
  True  True  True  True  True  True  True  True  True  True  True  True
  True  True  True  True  True  True  True  True  True  True  True  True
  True  True  True  True  True  True  True  True False  True  True]
attack_duration: [ 0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0 96  0  0  0  0  0
  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
  0  0  0  0  0  0  0  0 31  0  0]
time next maintenance [3744   -1   -1   -1   -1   -1   -1   -1   -1 1728   -1   -1   -1   -1
   -1   -1   -1   -1    0   -1   -1   -1   -1   -1   -1   -1   -1   -1
   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1
   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1   -1
   -1   -1   -1]
maintenance duration [96  0  0  0  0  0  0  0  0 96  0  0  0  0  0  0  0  0 96  0  0  0  0  0
  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
  0  0  0  0  0  0  0  0  0  0  0]

Expected output

Please look at my comments in the code snippet. I found this when I try to test the behavior of obs.simulate, what I find is that the maintenance of duration 96 is treated as an attack of duration 96, which is indicated by the the attack_duration of power line 18. As I understand it, the true attacks are currently hard coded to have duration of 48. I think the above results indicates that line maintenance are treated as attacks in the environment and obs.simulate will not be able to predict a line maintenance as they are essentially attacks.

The text was updated successfully, but these errors were encountered:

BDonnot · 2020-09-16T08:34:02Z

Hello,

I think there is some misunderstanding here on the "obs.time_before_cooldown_line" that is not exactly the attack. Actually cooldown on line can come of 3 different manners:

the agent changed the status of the powerline
there is a maintenance
there is an attack

Cooldown only means "you cannot act on the status of this powerline for XXX steps"

So yes, you have the impression that Line maintenance seems to be treated as an attack, which is reflected in the attack duration. because you just looked at the cooldown, which covers also maintenance and actions.

However, you are right it appears maintenance, on the first time step, are not correctly taken into account into simulate (there is a difference of 1 time steps).
You can manually disconnect the powerlines when you "simulate" for example.

DesmondZhong · 2020-09-16T18:21:07Z

Thanks for your explanation! It's good to know that the behavior of simulate in 1.2.2 does not take maintenance into account and I guess it will remain as it is in the current competition environment.

Actually, I kind of like the wrong behavior of the "simulate" function since it could possibly make the my RL agent easier to code. I know from the definition of "simulate", you probably want to fix it to take maintenance into account. Maybe it is a good idea to make the fix as the default and retain the option of not considering maintenance as well. I don't know if other people want this feature or not.

Anyway, thanks for addressing this issue! I'll close it since it has been fixed.

Some improvments, mainly for gym_compat

DesmondZhong added the bug Something isn't working label Sep 15, 2020

BDonnot changed the title ~~line maintenance are treated as attack, unexpected behavior in obs.simulate~~ Line that will be in maintenance next time step are not taken into account in the "simulate" function Sep 16, 2020

BDonnot added a commit to BDonnot/Grid2Op that referenced this issue Sep 16, 2020

fixing issue Grid2op#148

580a584

DesmondZhong closed this as completed Sep 16, 2020

BDonnot added a commit that referenced this issue Feb 7, 2022

Merge pull request #148 from BDonnot/bd_dev

53035a1

Some improvments, mainly for gym_compat

BDonnot mentioned this issue Dec 12, 2022

Update to version 1.8.0 #384

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Line that will be in maintenance next time step are not taken into account in the "simulate" function #148

Line that will be in maintenance next time step are not taken into account in the "simulate" function #148

DesmondZhong commented Sep 15, 2020

BDonnot commented Sep 16, 2020

DesmondZhong commented Sep 16, 2020

Line that will be in maintenance next time step are not taken into account in the "simulate" function #148

Line that will be in maintenance next time step are not taken into account in the "simulate" function #148

Comments

DesmondZhong commented Sep 15, 2020

Environment

Bug description

How to reproduce

Code snippet

Current output

Expected output

BDonnot commented Sep 16, 2020

DesmondZhong commented Sep 16, 2020