New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Rts gmlc parser updates #209

Merged

bknueven merged 13 commits into grid-parity-exchange:main from darrylmelander:rts-gmlc-parser-updates

May 21, 2021

Collaborator

michaelbynum commented Feb 19, 2021 •

edited

Loading

From @darrylmelander in the previous PR:

The new RTS-GMLC parser is targeted at reading a full set of data in RTS-GMLC format, and then generating multiple Egret models for various time spans within the read-in data.

Here are some of the parsing details:

For branches, instead of using 'Cont Rating' for all three rating types, we now use:
- rating_long_term: Cont Rating
- rating_short_term: LTE Rating
- rating_emergency: STE Rating
For buses:
- the bus name (system['bus'][{bus name}]) is 'Bus Name' (was 'Bus ID')
- the bus "id" is now 'Bus ID' (was 'Bus Name')
- generators now refer to the bus by name instead of ID (system['generator'][{gen name}]['bus'] = {bus name})
Generators:
- We omit the 'CSP' generator type
- Because it only applies to CSP, we omit 'Natural Inflow' time series data
Reserves:
- Reserves are identified entirely by a naming convention. For system-wide reserves, we use this mapping:
RTS-GMLC Name Egret Name

Spin_Up spinning_reserve

Reg_Up regulation_up

Reg_Down regulation_down

Flex_Up flexible_ramp_up

Flex_Down flexible_ramp_down
- For area-specific reserves, we add a suffix to the system-wide names, namely "_R{Area Name}", such as "Spin_Up_RArea1" for the spinning reserve for an area named Area1.
- We don't get reserve information from anywhere other than timeseries_pointers.csv. We don't use reserves.csv. We could use the "Requirement (MW)" column in this file to populate the skeleton with static values, and then overwrite them if they have a timeseries associated with them. But we don't, so reserves are only included if they have a timeseries associated with them.

Collaborator Author

michaelbynum commented Feb 19, 2021

From the previous PR, @bknueven said:

We don't get reserve information from anywhere other than timeseries_pointers.csv. We don't use reserves.csv. We could use the "Requirement (MW)" column in this file to populate the skeleton with static values, and then overwrite them if they have a timeseries associated with them. But we don't, so reserves are only included if they have a timeseries associated with them.

I would be in favor of this -- it's not necessarily unusual to carry static amounts of certain types of reserves, and I believe Egret should already support having a single MW value for every reserve type.

darrylmelander force-pushed the rts-gmlc-parser-updates branch from c4bca29 to 850b3b9 Compare

April 8, 2021 23:34

bknueven self-requested a review

May 11, 2021 18:24

darrylmelander mentioned this pull request

Support for RTS-GMLC as input format grid-parity-exchange/Prescient#95

Merged

bknueven reviewed

View reviewed changes

Collaborator

bknueven left a comment

I attempted to load and solve an RTS-GMLC instance ... which proved to be somewhat troublesome. I believe I captured my issues in the comments.

egret/parsers/rts_gmlc/parser.py Show resolved Hide resolved

egret/parsers/rts_gmlc/parser.py Outdated

Comment on lines 71 to 97

+                  begin_time : datetime.datetime or str
+                      Beginning of time horizon. If str, date/time in "YYYY-MM-DD HH:MM:SS" or "YYYY-MM-DD" format,
+                      the later of which assumes a midnight start.
+                  end_time : datetime.datetime or str
+                      End of time horizon. If str, date/time in "YYYY-MM-DD HH:MM:SS" or "YYYY-MM-DD" format,
+                      the later of which assumes a midnight start.
+                  simulation : str
+                      Either "DAY_AHEAD" or "REAL_TIME", which specifies which time series the data is taken from,
+                      default is "DAY_AHEAD".
+                  t0_state : dict or Nonetype
+                      Keys of this dict are thermal generator names, each element of which is another dictionary with
+                      keys "initial_status", "initial_p_output", and "initial_q_output", which specify whether the
+                      generator is on at t0, the real power output at t0, and the reactive power output at t0.
+                      If this is None, default values are loaded.
+                  Returns
+                  -------
+                      dict : A dictionary in the format required for the ModelData object.
+                  """
+                  cache = parse_to_cache(rts_gmlc_dir, begin_time, end_time, t0_state)
+                  model = cache.generate_model(simulation, begin_time, end_time)

Collaborator

bknueven May 12, 2021

This function does not really support passing begin_time/end_time as a string because ParsedCache.generate_model wants them as datetime.datetime. I would suggest converting begin_time/end_time within this function.

Collaborator

darrylmelander May 19, 2021

Done

egret/parsers/rts_gmlc/parser.py Outdated

Comment on lines 69 to 70

		rts_gmlc_dir : str
		Path to RTS-GMLC directory

Collaborator

bknueven May 12, 2021 •

edited

Loading

The code seems to assume this is RTS-GMLC/RTS_Data/SourceData, where RTS-GMLC is the directory the RTS-GMLC repo is cloned to. We should update the docstring to reflect that.

Collaborator

darrylmelander May 19, 2021

Docstring updated

egret/parsers/rts_gmlc/parser.py Outdated


		load_participation_factors = _compute_bus_load_participation_factors(model_data)

		set_t0_data(model_data, rts_gmlc_dir, None)

Collaborator

bknueven May 12, 2021

I think we should be passing t0_state into this function in place of None

Collaborator

darrylmelander May 20, 2021

Fixed

egret/parsers/rts_gmlc/parser.py Outdated Show resolved Hide resolved

egret/parsers/rts_gmlc/parser.py Outdated

Comment on lines 80 to 84

+                  t0_state : dict or Nonetype
+                      Keys of this dict are thermal generator names, each element of which is another dictionary with
+                      keys "initial_status", "initial_p_output", and "initial_q_output", which specify whether the
+                      generator is on at t0, the real power output at t0, and the reactive power output at t0.
+                      If this is None, default values are loaded.

Collaborator

bknueven May 12, 2021

When None, we do not load default values, which we should do for convenience (perhaps with warning).

Collaborator

darrylmelander May 20, 2021

Fixed comment

egret/parsers/rts_gmlc/parser.py Outdated Show resolved Hide resolved

egret/parsers/rts_gmlc/parser.py Outdated Show resolved Hide resolved

egret/parsers/rts_gmlc/parser.py Outdated

Comment on lines 773 to 775

+                      if len(end_time) == len(datestr):
+                          end_time += midnight
+                      end_time = datetime.strptime(end_time,datetime_format)

Collaborator

bknueven May 12, 2021

Similar to above (probably want a function for this repeated code).

Collaborator

darrylmelander May 20, 2021

I created a function with the repeated code. I also switched from strptime with a fixed format to dateutil.parser.parse, which infers the date format from the string. See _convert_to_datetime().

egret/parsers/rts_gmlc/parser.py Outdated Show resolved Hide resolved

darrylmelander added 10 commits

May 19, 2021 14:09


          Updated RTS-GMLC parser.

28dba41

New parser is targeted at reading a full set of data in RTS-GMLC
format, and then generating multiple Egret models for various
time spans within the read-in data.


          Fixes to RTS-GMLC parser.

6258d99

* Return dict (not ModelData) from create_model_data_dict()
* Parse dates if provided as strings
* Don't multiply timeseries by their Scaling Factor


          Read constant reserve requirements from reserves.csv.

94904a9

They follow the same naming conventions as timeseries data.  The constant requirement values in reserves.csv are only used if there is not a corresponding timeseries in timeseries_pointers.csv; otherwise the constant value is replaced with timeseries values.


          Remove unused file

1e9d074


          Set initial generator state when reading RTS-GMLC.

164bb12

Initial state is taken from a passed in t0_state dict.  If none is
passed in, a file named "initial_status.csv" is read and data is pulled
from there.  That file can have 1, 2, or 3 data lines (initial_status,
initial_p_output, and initial_q_output).  It must have line 1, but if
it doens't have line 2 or 3, initial values are set to p_min and q_min.

Note that if you are caching the results of parsing, initial state is
not set (otherwise, every date range would end up with the same initial
state).  For models generated from the parsed cache, you can call
rts_gmlc_parser.set_t0_data() on the resulting model.

Also, treat ROR generators as HYDRO.


          Make some RTS-GMLC fields optional or more flexible.

b0d8a7c

Shunt-related columns can be omitted if they are never used.

There is a variable number of fuel-related columns.  The files in the RTS-GMLC repository
use 5 columns, but the last of these 5 columns is never used (it is always 'NA').  The
parser was hard-coded to read 4 columns and ignore the 5th.  While this works for the
repository files, it is overly rigid.  The new code allows any number of columns, using
whatever columns are present and appropriately populated for each generator.


          A parsed data cache now allows you to get the model skeleton and popu…

4316c9c

…late it in two separate steps.


          Fixes/tweaks to RTS-GMLC parser.

17226fa

* Zone name can be any string, not just integers
* Don't include startup_cost or p_cost. Include startup_fuel, p_fuel, and non_fuel_startup_cost instead.
* More flexibility in how startup fuel is specified when you have fewer than 3 points. It no longer matters which of the 3 startup fuel columns you leave blank in this case, as long as the provided data is consistent (cold is longer than hot, for example).
* More flexibility in how p_fuel fuel curves are specified. You can have any number of columns (up to 50). For a fuel curve with N valid points, you only have to fill in the first N fuel curve columns and leave the rest blank.  The number of points in the fuel curve can be different for each generator.
* Omit several properties that were in the original RTS-GMLC parser but have no meaning to Egret.
* Some optional properties are now left out of the JSON if the corresponding cells in the csv are left blank


          Don't include end_time in model instances returned from a parsed_cache

f18d482


          A few tweaks to rts-gmlc parser to keep prescient happy

43a95a1

darrylmelander mentioned this pull request

Remaining RTS-GMLC parsing features #229

Open

4 tasks


          Address issues found during review

689f359

darrylmelander force-pushed the rts-gmlc-parser-updates branch from 8e48435 to 689f359 Compare

May 20, 2021 22:55

bknueven and others added 2 commits

May 21, 2021 11:34


          ignoring dc_branches and adding default initial status (#1)

f29bd46


          Cleaning up a comment

dece0ee

bknueven approved these changes

View reviewed changes

bknueven merged commit 93a1345 into grid-parity-exchange:main

bknueven mentioned this pull request

adding missing __init__ file #230

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet