add new data source: Ember #327

pweigmann · 2022-10-14T13:34:08Z

Includes new readSource() and convert() function for the yearly electricity data set from Ember:
https://ember-climate.org/data-catalogue/yearly-electricity-data/

The Ember data includes

electricity capacities which are added to calcCapacity(subtype = "ember"),
electricity generation which is used in a new function calcSE() and
emissions from electricity supply which are currently not used anywhere.

In this case, I tried to follow the madrat etiquette in dividing the Ember data set into the respective calcOutput() functions of the "category" of variables that are present (capacities, SE). However, other data sets contain a much wider range of variables making this approach very tedious and the code repetitive. In some of these case we created a calcOutput() function per source rather than per variable category (BP, HRE, ...).

Any ideas how to organize this in the future?

cchrisgong · 2022-10-14T13:40:02Z

Thanks! I'm sorry I didn't fully understand your question though.. I thought the logic is like one does the read and convert function based on sources (BP, Ember..), then this is called in something like calcCapacity and calcSE, where many sources can be converted and read in. Tagging @Renato-Rodrigues for a second opinion

pweigmann · 2022-10-14T13:45:00Z

Thanks! I'm sorry I didn't fully understand your question though.. I thought the logic is like one does the read and convert function based on sources (BP, Ember..), then this is called in something like calcCapacity and calcSE, where many sources can be converted and read in. Tagging @Renato-Rodrigues for a second opinion

Yes, that's the ideal. My point is, for some sources this isn't very practical because they have data for all variables and it would require a lot of repeated code in many functions. A "lighter" and quicker approach is to only have one calc-function for that source (which is what Falk and I and maybe also others did in the past...). Definitely curious about @Renato-Rodrigues opinion as well, thanks!

Renato-Rodrigues · 2022-10-14T14:40:59Z

I am not sure I am following 100% the discussion, but if it helps I always follow these principles when delaing wiht REMIND input data:

each data source should have a single read and convert function.
read and convert functions can have different subtypes if you want to treat different different data from the same source, ex: see for example readREMIND_11Regi.R and convertREMIND_11Regi.R.
the read function should return the data as close as possible from what we get directly from the data source.
the convert function should return the data in a way that makes sense to remind, i.e., disaggregated at country level and with variables and technology names as close as possible to the ones that we use as standard.
calcOutput functions serve mainly two purposes: (1) merge together different sources data that refer to the same topic, or (2) do the necessary transformations to create an input file for remind.

So, I would approach this in a different way.

I would create a readEmber function that has different subtypes (ex: "capacity", "demand", "generation", "imports", "emissions", "wholesale_price"). If all the data comes from the same place you can load all together no matter the subtype and just filter afterwards what you want to show.
I would create a convert function that fill the missing country values, and map if possible data using name conventions as close as possible to what we use. Ex: you could map under convertEmber(x,"capacity") "wind" values to "Cap|Wind", and so on. You can use a single mapping file to convert multiple subtype variables if you want.
Historical mif file creation could call directly readSource("Ember",subtype="capacity") for any ember specific values.
I would only add a ember reference to the calcCapacity function if an ember capacity information is important or has better quality to determine REMIND historical bounds used in the model, and in this case you would not need a new subtype for that as this is already included in an existent subtype in the function.

pweigmann · 2022-10-18T09:49:16Z

2. You can use a single mapping file to convert multiple subtype variables if you want.

Thanks for the input, I will do it the way you proposed! Somehow, I wasn't aware that the mapping step can be in the convert function, but it does make a lot of sense for me.

This also means, that all of the steps to bring the data in the right format so that it can be used in the historical.mif needs to be moved from the calcOutput function to the convert function. In this case, I don't see a problem here but am not sure if this is the "madrat-way" to do things?

Renato-Rodrigues · 2022-10-18T09:58:13Z

I am not sure what I wrote is 100% compatible with the "madrat way", but this was always my work flow for dealing with input data that can be used in the model.
If you want a second opinion on that you could ask somebody from the RSE group.

LaviniaBaumstark · 2022-10-20T09:11:14Z

Hi, in most parts @Renato-Rodrigues explained the madrat way. The only part which is a bit different is where teh mapping to REMIND-specific variables is happening. Mostly, we recommend adjusting only the spatial dimension in a convert* function (providing information for all ISO countries). The mapping (which can also include some calculations) should happen in a calc* funciton. If you do not want to repeat it in many other calc* functions using the same source, you can write a calc* function only for mapping variable names. This "intermediate" calc* function can than be used by all following calc* functions.

pweigmann · 2022-10-21T15:33:49Z

Hi, in most parts @Renato-Rodrigues explained the madrat way. The only part which is a bit different is where teh mapping to REMIND-specific variables is happening. Mostly, we recommend adjusting only the spatial dimension in a convert* function (providing information for all ISO countries). The mapping (which can also include some calculations) should happen in a calc* funciton. If you do not want to repeat it in many other calc* functions using the same source, you can write a calc* function only for mapping variable names. This "intermediate" calc* function can than be used by all following calc* functions.

I changed the structure of the functions and use a 'calcEmber()' with subtypes now. Is this more or less what you imagined @LaviniaBaumstark ?

LaviniaBaumstark · 2022-10-21T15:56:28Z

R/calcEmber.R

+
+  if (subtype == "capacity") {
+    # choose only capacity variables
+    x <- x[, , "GW"]


why do you need to treat "capacity" and "generation" special here and cannot always read in all?

ah this is already the calc-function - got it

LaviniaBaumstark · 2022-10-21T15:58:21Z

R/calcEmber.R

+#'
+#' @export
+
+calcEmber <- function(subtype = "all") {


maybe another name would help understanding, what is happening, e.g. calcEmberCleaned ?

So, I didn't plan to use another calc function for "Ember" but would just call calcOutput("Ember", subtype = "capacity") in calcCapacity() for example. The only thing left to do there would be to convert to TW.

Pascal Weigmann and others added 6 commits October 5, 2022 17:08

readEmber wip

a18b7c6

add historical Ember power capacities

45841c2

add electricity generation from Ember

f528d80

Merge branch 'master' of https://github.com/pik-piam/mrremind into ember

5743c0c

add Ember to historical.mif, increment version, fix linter warnings

2e1ef04

Merge branch 'pik-piam:master' into ember

4a12561

pweigmann requested review from mikapfl, giannou, cchrisgong, LaviniaBaumstark and fbenke-pik October 14, 2022 13:34

merge master

71d6018

Pascal Weigmann added 2 commits October 21, 2022 17:29

refactor Ember data preparation

d297a79

Merge branch 'ember' of https://github.com/pweigmann/mrremind into ember

a19ab1b

LaviniaBaumstark approved these changes Oct 21, 2022

View reviewed changes

pweigmann merged commit dd270f5 into pik-piam:master Oct 21, 2022

0UmfHxcvx5J7JoaOhFSs5mncnisTJJ6q mentioned this pull request Nov 4, 2022

There is no source Ember #330

Closed

pweigmann deleted the ember branch November 9, 2022 10:19

orichters mentioned this pull request Apr 3, 2023

capital / consumption jumps between Nov / Dec 2022 remindmodel/remind#1276

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add new data source: Ember #327

add new data source: Ember #327

pweigmann commented Oct 14, 2022

cchrisgong commented Oct 14, 2022

pweigmann commented Oct 14, 2022

Renato-Rodrigues commented Oct 14, 2022 •

edited

Loading

pweigmann commented Oct 18, 2022

Renato-Rodrigues commented Oct 18, 2022

LaviniaBaumstark commented Oct 20, 2022

pweigmann commented Oct 21, 2022

LaviniaBaumstark Oct 21, 2022

LaviniaBaumstark Oct 21, 2022

LaviniaBaumstark Oct 21, 2022

pweigmann Oct 21, 2022 •

edited

Loading

add new data source: Ember #327

add new data source: Ember #327

Conversation

pweigmann commented Oct 14, 2022

cchrisgong commented Oct 14, 2022

pweigmann commented Oct 14, 2022

Renato-Rodrigues commented Oct 14, 2022 • edited Loading

pweigmann commented Oct 18, 2022

Renato-Rodrigues commented Oct 18, 2022

LaviniaBaumstark commented Oct 20, 2022

pweigmann commented Oct 21, 2022

LaviniaBaumstark Oct 21, 2022

Choose a reason for hiding this comment

LaviniaBaumstark Oct 21, 2022

Choose a reason for hiding this comment

LaviniaBaumstark Oct 21, 2022

Choose a reason for hiding this comment

pweigmann Oct 21, 2022 • edited Loading

Choose a reason for hiding this comment

Renato-Rodrigues commented Oct 14, 2022 •

edited

Loading

pweigmann Oct 21, 2022 •

edited

Loading