Integrate `income_statement_ferc1` table #2147

cmgosnell · 2022-12-19T22:21:32Z

Overview

Closes #1813
This tbl is weird because it has two dbf tabels flowing into one pudl table.

bespoke extract step the concatenates the two dbf tables together
pudl metadata resource and field definitions
row maps for the dbf row numbers (which is 348 of the line changes!)
both the dbf and xbrl table need a reshape (dbf table has columns for each utility_type while the xbrl has multiple columns for each income_type
I enabled align_row_numbers_dbf to take a list of dbf_table_names instead of a single dbf_table_name. This felt a little silly because this many be only one table that ever needs this but it was very easy to implement and felt simpler than having two version of align_row_numbers_dbf
I had to make an overridden version of source_table_id because the Ferc1AbstractTableTransformer assumes there is only one source table. The new version uses the table name that was added during the extract step. We cooooould do this for all of the DBF tables, but we would need to always add the XBRL table name directly from the extract step. That all felt too.. much for one table. The main quirk here is that source_table_id in the main assign_record_id method takes a source_ferc1 and a df. for all of the other tables the df does nada, but the income statement table needs it. The default method as kwargs so this works okay but feels a little weird.

PR Checklist

Before requesting a review of your pull request, please make sure you've done the
following:

Merge the most recent version of dev (or the appropriate upstream branch) into
your branch and resolved any merge conflicts. You may need to do this several
times over the course of a PR as dev changes frequently.
Verify that all of the CI checks on your PR are passing. See
Running Tests with Tox
for details on how to run the full test suite locally if you need to debug a
particular failure.
Ensure that the docstrings for any new modules, classes, functions, or methods are
descriptive enough for developers and users to understand your code.
If you expanded data coverage or changed the outputs, ensure that the full
data validation tests
pass locally on a fresh DB.
If you've added new functions or classes, ensure that they have at least basic
unit tests.
If you've added new analyses, make sure they include defensive sanity checks that
will catch unexpected data issues.
Update the
release notes
to reflect your changes. Make sure to reference the PR and any related issues.
Do your own review of the PR. Add comments highlighting areas where you have
questions you'd like reviewers to answer, known issues, solutions you're
unsatisfied with, or other things that deserve special attention from the
reviewer.

src/pudl/extract/ferc1.py

src/pudl/metadata/fields.py

codecov · 2022-12-19T23:23:37Z

Codecov Report

Base: 85.3% // Head: 85.4% // Increases project coverage by +0.0% 🎉

Coverage data is based on head (cbff8fa) compared to base (c959d4b).
Patch coverage: 89.5% of modified lines in pull request are covered.

Additional details and impacted files

@@          Coverage Diff          @@
##             dev   #2147   +/-   ##
=====================================
  Coverage   85.3%   85.4%           
=====================================
  Files         73      73           
  Lines       8746    8777   +31     
=====================================
+ Hits        7469    7496   +27     
- Misses      1277    1281    +4

Impacted Files	Coverage Δ
src/pudl/metadata/fields.py	`100.0% <ø> (ø)`
src/pudl/metadata/resources/ferc1.py	`100.0% <ø> (ø)`
src/pudl/transform/params/ferc1.py	`100.0% <ø> (ø)`
src/pudl/transform/ferc1.py	`94.7% <88.0%> (-0.5%)`	⬇️
src/pudl/extract/ferc1.py	`86.0% <100.0%> (+0.2%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

src/pudl/extract/ferc1.py

src/pudl/metadata/fields.py

src/pudl/package_data/settings/etl_full.yml

src/pudl/metadata/resources/ferc1.py

src/pudl/transform/ferc1.py

src/pudl/package_data/ferc1/dbf_to_xbrl.csv

src/pudl/transform/ferc1.py

…come

src/pudl/transform/ferc1.py

zaneselvans · 2022-12-22T22:22:49Z

I guess we never figured out why you were having weird Numpy failures in the CI. If the builds pass would you bump the max Numpy version back up to <1.25?

zaneselvans

Hey these changes look good. Hopefully the numpy thing isn't an issue!

test/validate/ferc1_test.py

cmgosnell added 5 commits December 14, 2022 12:35

inital extract of income statement tbl

c784534

Merge branch 'comp_bal' into income

972cc57

begining of income statement transformers

0c056b6

Merge branch 'dev' into income

48cd2c0

first pass of transforms for income statement table

1854c53

cmgosnell added ferc1 Anything having to do with FERC Form 1 rmi xbrl Related to the FERC XBRL transition dbf Data coming from FERC's old Visual FoxPro DBF database file format. labels Dec 19, 2022

cmgosnell self-assigned this Dec 19, 2022

Merge branch 'dev' into income

3d19870

cmgosnell commented Dec 19, 2022

View reviewed changes

src/pudl/extract/ferc1.py Outdated Show resolved Hide resolved

cmgosnell commented Dec 19, 2022

View reviewed changes

src/pudl/metadata/fields.py Outdated Show resolved Hide resolved

cmgosnell linked an issue Dec 19, 2022 that may be closed by this pull request

Transform f1_income_stmnt & f1_incm_stmnt_2 xbrl + dbf #1813

Closed

add income table to settings files (doh!)

8f43e9b

zaneselvans requested changes Dec 20, 2022

View reviewed changes

zaneselvans and others added 7 commits December 19, 2022 21:16

Roll back to numpy<1.24

eaae3fa

deal with duplicate income rows

fe1f790

Merge branch 'income' of github.com:catalyst-cooperative/pudl into in…

dacf80c

…come

Merge branch 'dev' into income

8175b59

add row type to dbf map

b4b2872

add unit test for multi-table dbf map filling

7ab267a

enable read_dbf_to_xbrl_map to read list of tables

daa1982

zaneselvans requested changes Dec 22, 2022

View reviewed changes

src/pudl/transform/ferc1.py Outdated Show resolved Hide resolved

src/pudl/transform/ferc1.py Outdated Show resolved Hide resolved

src/pudl/transform/ferc1.py Show resolved Hide resolved

src/pudl/transform/ferc1.py Outdated Show resolved Hide resolved

cmgosnell added 2 commits December 22, 2022 17:43

change numpy back to <1.25 and add assertion to assign_record_id

d980a56

Merge branch 'retained_earnings' into income

cbff8fa

zaneselvans approved these changes Dec 22, 2022

View reviewed changes

test/validate/ferc1_test.py Show resolved Hide resolved

cmgosnell merged commit 3094677 into dev Dec 23, 2022

cmgosnell deleted the income branch December 23, 2022 03:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate `income_statement_ferc1` table #2147

Integrate `income_statement_ferc1` table #2147

cmgosnell commented Dec 19, 2022 •

edited

Loading

codecov bot commented Dec 19, 2022 •

edited

Loading

zaneselvans commented Dec 22, 2022

zaneselvans left a comment

Integrate income_statement_ferc1 table #2147

Integrate income_statement_ferc1 table #2147

Conversation

cmgosnell commented Dec 19, 2022 • edited Loading

Overview

PR Checklist

codecov bot commented Dec 19, 2022 • edited Loading

Codecov Report

zaneselvans commented Dec 22, 2022

zaneselvans left a comment

Choose a reason for hiding this comment

Integrate `income_statement_ferc1` table #2147

Integrate `income_statement_ferc1` table #2147

cmgosnell commented Dec 19, 2022 •

edited

Loading

codecov bot commented Dec 19, 2022 •

edited

Loading