-
-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge EIA861 short-form transform #3660
Conversation
This reverts commit 574dbbc. remove index files
For more information, see https://pre-commit.ci
For more information, see https://pre-commit.ci
For more information, see https://pre-commit.ci
For more information, see https://pre-commit.ci
…wipe_and_reset migration
Okay @aesharpe @Nancy9ice it looks like we've got some real integration test failures here:
|
Okay, I'll look at it today. Is it the usual make pytest coverage command that I'll run to see if my changes solves this error? |
Hi Nancy! We've been meeting in person for the last couple of days, thanks for your patience. Yes, |
I understand. I've looked at the errors and it's because the values in the utility_id_eia and the balancing_authority_code_eia don't match values in the tables that also have these columns which is a bit odd. Also, there are null values in the num_water_heater columns. For this one, should I change the NULL values to zero instead? |
With the all NULL values for For the FK constraints, we might need to add some additional exceptions to the rules that are used to generate the constraints, since none of the EIA-861 tables are harvested -- i.e. they aren't used as the raw materials to construct the EIA Utility Entity table. I'm not sure where the balancing authority code FK is pointing though, since we don't have an entity table for the BAs. It might be that that column needs to be encoded / cleaned up in the short form table if it's got some bad / non-standard values in it. I think @cmgosnell is going to pop in and take a look. |
For the num_water_heaters, if you check the raw data files, they are actually NULL. For example, all the water_heater rows in the 2022 short form table are NULL |
hey @Nancy9ice ! thanks for this insight! I went ahead and ran the CI tests locally and got the same results. I also generated this table with all of the years and it looks like this column is always null! So I'm going to add a defensive check in the transform to check if its always null and then remove it from the table. And the FK problem is very expected with the 861 tables. This is an example of a quirk that don't have great documentation for ;-) I'll push some changes momentarily and hopefully we'll get this merged in. |
Overview
Closes #3540 and replaces @Nancy9ice PR #3565 to get all the CI to run because permissions are hard.
Testing
How did you make sure this worked? How can a reviewer verify this?
To-do list