Consistently use baseline expanded_income to fuzz reform results in dropq tables #1537

martinholmer · 2017-08-30T21:48:52Z

Motivation

This pull request was developed after looking at the problems faced by @andersonfrailey in pull request #1521. The goal of that pull request was simple enough and his skills are substantial, but the complexity of the dropq difference table logic made the task in #1521 quite challenging. I looked at the issue and couldn't figure out how to reach the goal of #1521. My conclusion was that the dropq difference table requirements were not that complicated, but that the code was unnecessarily complex, and therefore, difficult to follow.

What's Been Done?

This pull request is an attempt to simplify the code that generates dropq difference tables. The main strategy is to revise the create_difference_table utility function code to be general enough to support difference table creation in the dropq_utils.py file. This allows the removal of the dropq_diff_table function.

In the course of doing that, it was easy to add two features missing in the old dropq logic.

FIrst, it is now possible to create difference tables with an income measure that the TaxBrain difference table calls "Adjusted Income", which I assume is AGI. The "Adjusted Income" in TaxBrain button says "Not yet implemented" when the mouse is over the button. The high-level dropq logic in this pull request does not generate difference tables with AGI as the income measure, but doing so would now be straightforward.
Second, the dropq difference tables now include the tax difference expressed as a percent of baseline after-tax expanded income. However, TaxBrain would have to be revised to display that new difference statistic. This was the objective of pull request [WIP] Add After-Tax Income Percent Change column to dropq results #1521.

After doing this difference-table work, it seemed natural to continue and simplify the code that generates dropq distribution tables. The same basic strategy was pursued: generalize the create_distribution_table utility function and use it to replace the dropq_dist_table function.

Note this pull request includes all the changes in pull request #1534.

Consequences

After this pull request is merged, there will be fewer lines of code and hopefully less confusing and better documented dropq code.

The new dump option in dropq test_run_tax_calc_model has been used to check that all these code changes leave the results returned by the dropq run_nth_year_tax_calc_model function unchanged. The aggregate tax results are unchanged and all the baseline distribution table results are unchanged. However, there are some changes in the other table results. These changes occur in reforms where the policy reform causes the value of expanded_income to change for some filing units. The old code did not use baseline expanded_income to assign filing units to bins in reform-distribution-table or difference-table construction, while the new code does do that per the discussion in issue #1540.

Also, the new code standardizes DataFrame column names. So, now all six of the difference tables have the same column names and all four of the distribution tables have the same columns names. That was not always the case in the old code.

And finally, the aggregate table has been constructed by fuzzing just 3 filing unit records (instead of the 30 records).

So, to summarize the consequences of this pull request:

First, difference table results may differ from the results generated by the old code.
Second, distribution and difference table column names may differ from what they were in the old code.
Third, the aggregate tax totals for the reform and reform-baseline difference will be slightly more accurate.

These changes suggest that this pull request would trigger an API change, meaning the the next release would be the 0.11.0 version.

Subsequent Work

During the course of this work, it became clear that it wold be far less risky to return a dictionary of named results (rather than a unnamed tuple of results that depends on the order of the thirteen results). That change will be made in a subsequent pull request. That change would also constitute an API change, but one that would be easy for TaxBrain developers to handle according to this comment by @hdoupe.

Also, we are still not generating any distribution or difference tables using AGI as the income measure to assign filing units to decile or income bins. The TaxBrain GUI implies that this is a forthcoming feature. If that is still desirable, we can add AGI binning in a subsequent pull request.

@MattHJensen @feenberg @Amy-Xu @andersonfrailey @hdoupe @GoFroggyRun @brittainhard

codecov-io · 2017-08-30T21:57:53Z

Codecov Report

Merging #1537 into master will not change coverage.
The diff coverage is 100%.

@@          Coverage Diff           @@
##           master   #1537   +/-   ##
======================================
  Coverage     100%    100%           
======================================
  Files          37      37           
  Lines        2565    2558    -7     
======================================
- Hits         2565    2558    -7

Impacted Files	Coverage Δ
taxcalc/taxcalcio.py	`100% <100%> (ø)`	⬆️
taxcalc/utilsprvt.py	`100% <100%> (ø)`	⬆️
taxcalc/utils.py	`100% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 53c129a...f6b3e8a. Read the comment docs.

talumbau · 2017-09-01T01:05:38Z

taxcalc/dropq/dropq_utils.py

                                          income_measure='expanded_income',
                                          result_type='weighted_sum')
-    dist1_bin = create_distribution_table(df1, groupby='webapp_income_bins',
+    for col in [c for c in list(df2) if c.endswith('_xdec')]:
+        df2[col[:-5]] = df2[col]


This is probably a bit too concise and contains a magic number (why -5?). I think a more maintainable solution would be to:

make the list comprehension a local variable with a descriptive name

give "5" a descriptive variable name and add a comment on why the last five elements of "col" are chopped off.

You're absolutely correct. Commit 542cedb tries to make things more clear.

martinholmer · 2017-09-02T18:53:06Z

Pull request #1537 is now complete and an overview of the pull request is now available.

If there are no concerns or objections, pull request #1537 (which includes #1534) will be merged into the master branch at the end of the day on Wednesday, September 6th.

MattHJensen · 2017-09-05T12:58:33Z

taxcalc/dropq/dropq.py

@@ -1,6 +1,7 @@
 """
 The dropq functions are used by TaxBrain to call Tax-Calculator in order
-to maintain the privacy of the micro data being used by TaxBrain.
+to maintain the privacy of the micro data being used by TaxBrain.  This
+is done by adding random "fuzz" to the results in each table cell.


Possibly: "This is done by adding random "fuzz" to the sample from which the results in each table cell are drawn."

You're right. The description of what "fuzzing" means needs to be improved.
Commit 537b1c0 is an attempt to improve the documentation.

feenberg · 2017-09-05T20:53:36Z

On Tue, 5 Sep 2017, Martin Holmer wrote: @martinholmer commented on this pull request. ____________________________________________________________________________ In taxcalc/dropq/dropq.py: > @@ -1,6 +1,7 @@ """ The dropq functions are used by TaxBrain to call Tax-Calculator in order -to maintain the privacy of the micro data being used by TaxBrain. +to maintain the privacy of the micro data being used by TaxBrain. This +is done by adding random "fuzz" to the results in each table cell.

That isn't the best description of the method. can't we do better? Is it any harder to say "by dropping 3 randomly selected taxable returns from each table row"? dan

…

You're right. The description of what "fuzzing" means needs to be improved. Commit 537b1c0 is an attempt to improve the documentation. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.[AHvQVdXCIL9ULSVJJMlY4RR2H5m-zbtOks5sfZ_0gaJpZM4PIBfE.gif]

martinholmer · 2017-09-05T22:33:18Z

@feenberg said why can't we just say the dropq logic is

dropping 3 randomly selected taxable returns from each table row

Because the dropq logic does not drop any returns from a table row.

martinholmer · 2017-09-06T14:26:37Z

@MattHJensen and @feenberg, Both of you have raised concerns about the wording of the top docstring in the dropq.py file. Here is the latest wording:

The dropq functions are used by TaxBrain to call Tax-Calculator in order
to maintain the privacy of the IRS-SOI PUF data being used by TaxBrain.
This is done by "fuzzing" reform results for several randomly selected
filing units in each table cell.  The filing units randomly selected
differ for each policy reform and the "fuzzing" involves replacing the
post-reform tax results for the selected units with their pre-reform
tax results.

If you have any remaining concerns, please raise them now.
If part of the wording is incorrect or vague, I would appreciate your suggestions for alternative wording.

martinholmer · 2017-09-06T15:20:02Z

After one more review of pull request #1537, I have found myself wondering about one more question.

As I understand it, the basic approach in the dropq logic is to fuzz (or obscure) reform results for three randomly selected filing units in each table row. That means that when constructing decile tables, 30 units (three in each decile) are selected for fuzzing. And when constructing WEBAPP_INCOME_BINS tables, 36 units (three in each of the twelve WEBAPP_INCOME_BINS) are selected for fuzzing. We can confirm this with the results from the following adhoc dump of a test using a PUF subsample.

bin_type=dec
True     10746
False       30
Name: nofuzz, dtype: int64

bin_type=bin
True     10740
False       36
Name: nofuzz, dtype: int64

Notice that the totals in the decile tables will not be exactly the same as the totals in the BINS tables, but that is to be expected.

So far, I have no questions.

But what about the aggregate (or fiscal totals) table, which is shown at the top of the TaxBrain "Static Results" page? One way to view that table is that it is a single bin table. If we adopt that view, then shouldn't we be fuzzing just 3 units?

From a broader perspective, there appears to be four ways to construct the aggregate table amounts:

no fuzzing of reform results
fuzz reform results for just 3 filing units
fuzz reform results for the 30 filing units fuzzed in decile tables
fuzz reform results for the 36 filing units fuzzed in BINS tables

Looking at the code on the master branch, it uses the way described in 3.
And looking at the code in this pull request, it also uses the way described in 3.

But why not use the way described in 2? That approach is a consistent application of the rule of fuzzing three records in each table row. Or, is fuzzing the aggregate results not required from a privacy perspective? If not required, then the way described in 1 would be just fine.

So, my question is which of the four ways should be used in the aggregate table.

@MattHJensen @feenberg @Amy-Xu @andersonfrailey @hdoupe @GoFroggyRun

feenberg · 2017-09-06T17:23:02Z

Excellent.

…

On Wed, 6 Sep 2017, Martin Holmer wrote: @MattHJensen and @feenberg, Both of you have raised concerns about the wording of the top docstring in the dropq.py file. Here is the latest wording: The dropq functions are used by TaxBrain to call Tax-Calculator in order to maintain the privacy of the IRS-SOI PUF data being used by TaxBrain. This is done by "fuzzing" reform results for several randomly selected filing units in each table cell. The filing units randomly selected differ for each policy reform and the "fuzzing" involves replacing the post-reform tax results for the selected units with their pre-reform tax results. If you have any remaining concerns, please raise them now. If part of the wording is incorrect or vague, I would appreciate your suggestions for alternative wording. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.[AHvQVX7ykiYYw_vOtWP-RrBzPh2hVznAks5sfquegaJpZM4PIBfE.gif]

MattHJensen · 2017-09-06T19:26:10Z

@martinholmer, I like the new description of what 'fuzzing' is.

With regard to the liabilities table, I think we should fuzz 3 records from each cell rather than 30.

martinholmer · 2017-09-06T19:31:27Z

@MattHJensen said in #1537:

With regard to the [aggregate] liabilities table, I think we should fuzz 3 records from each cell rather than 30.

OK, I'll make one final change to #1537 to fuzz just three records when computing the aggregate tables.

@feenberg

feenberg · 2017-09-07T12:08:11Z

On Wed, 6 Sep 2017, Martin Holmer wrote: @MattHJensen said in #1537: With regard to the [aggregate] liabilities table, I think we should fuzz 3 records from each cell rather than 30. OK, I'll make one final change to #1537 to fuzz just three records when computing the aggregate tables.

Where did 30 come from? Is there something I should be aware of? dan

…

@feenberg — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.[AHvQVeii6zbhsyHR33pyPrcvK11b15qJks5sfvMQgaJpZM4PIBfE.gif]

MattHJensen · 2017-09-07T12:24:53Z

@feenberg asked:

Where did 30 come from? Is there something I should be aware of?

We were dropping the same records that were dropped from the decile table, 3 for each cell. Here is @martinholmer's initial comment about the issue.

martinholmer added 19 commits August 28, 2017 20:26

Streamline dropq and util diff-table code

624b4f6

Remove unneeded float casts in utilsprvt.py

ae319af

Move perc_aftertax calc into diff_table_stats utility function.

1d7fbf6

Move more diff table logic to diff_table_stats function

6113ebe

Move create_difference_table logic to diff_table_stats

aa86c8b

Use create_difference_table utility in dropq logic

29be90b

Rename function in utilsprvt.py

3a97e38

Complete renaming to weighted_perc_cut

6a1cdb5

Minor change in create_difference_table handling of input

0a45826

Change current_year ValueError to assert in utils.py

bb15e85

Merge in recent changes on master branch

4a0d609

Improve diff-table label for per_aftertax column

9925a3b

Nest diff_table_stats function in create_difference_table utility

8df23d2

Cosmetic consistency change from 1e99 to 9e99 in bins

df47734

Remove obsolete tests from test_dropq.py

7ce7357

Add new test in test_dropq.py

25c2762

Add stronger create_distribution_table tests

18f9a98

Remove baseline_obj and diff arguments from create_distribution_table

381e858

Remove obsolete dropq_dist_table tests

49b2f99

talumbau added the in progress label Aug 30, 2017

martinholmer added 5 commits August 31, 2017 09:34

Revise create_distribution_table arguments

246f41c

Merge branch 'master' into revise-dist-table

a95f04c

First step in fixing dropq fuzzing logic

1abdc6b

Second step in fixing dropq fuzzing logic

a918c1a

Third step in fixing dropq fuzzing logic

90432f0

talumbau reviewed Sep 1, 2017

View reviewed changes

martinholmer added 3 commits August 31, 2017 21:25

Clarify code in dropq_summary function

542cedb

Simplify dropq test_run_tax_calc_model

0b62f5a

Change add_*_bins function arguments

23a08a1

Revise a few dropq comments

a8594ee

Update RELEASES.md info

3dd22bc

MattHJensen reviewed Sep 5, 2017

View reviewed changes

martinholmer added 4 commits September 5, 2017 15:08

Merge branch 'master' into revise-dist-table

afa4a55

Better documentation what 'fuzzing' results in dropq means

537b1c0

Add doc and asserts to create_di*table utility functions

d3ae770

Clarify documentation in both dropq files

aed63de

martinholmer added 3 commits September 5, 2017 18:35

Consistently use baseline income measure for binning in dropq logic

7b79c22

Add test of new utils.py code

c62df24

Simplify nested fuzz function in dropq_utils.py

c920116

martinholmer changed the title ~~Use revised create_distribution_table instead of dropq_dist_table~~ Consistently use baseline expanded_income to fuzz reform results in dropq tables Sep 6, 2017

Edit top docstring in dropq.py

b1e54ef

martinholmer mentioned this pull request Sep 6, 2017

Which expanded_income to use in TaxBrain/dropq difference table bins? #1540

Closed

Construct dropq aggregate table by fuzzing just three records

f6b3e8a

martinholmer merged commit 7e3458f into PSLmodels:master Sep 6, 2017

talumbau removed the in progress label Sep 6, 2017

martinholmer mentioned this pull request Sep 6, 2017

[WIP] Add After-Tax Income Percent Change column to dropq results #1521

Closed

martinholmer deleted the revise-dist-table branch September 6, 2017 22:26

martinholmer mentioned this pull request Sep 7, 2017

Return a dictionary (not a tuple) from dropq run_nth_year_tax_calc_model #1543

Merged

martinholmer mentioned this pull request Oct 23, 2017

New Difference Table column ospc-org/ospc.org#709

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistently use baseline expanded_income to fuzz reform results in dropq tables #1537

Consistently use baseline expanded_income to fuzz reform results in dropq tables #1537

martinholmer commented Aug 30, 2017 •

edited

Loading

codecov-io commented Aug 30, 2017 •

edited

Loading

talumbau Sep 1, 2017

martinholmer Sep 1, 2017

martinholmer commented Sep 2, 2017

MattHJensen Sep 5, 2017

martinholmer Sep 5, 2017

feenberg commented Sep 5, 2017 via email

martinholmer commented Sep 5, 2017

martinholmer commented Sep 6, 2017

martinholmer commented Sep 6, 2017

feenberg commented Sep 6, 2017 via email

MattHJensen commented Sep 6, 2017

martinholmer commented Sep 6, 2017

feenberg commented Sep 7, 2017 via email

MattHJensen commented Sep 7, 2017

Consistently use baseline expanded_income to fuzz reform results in dropq tables #1537

Consistently use baseline expanded_income to fuzz reform results in dropq tables #1537

Conversation

martinholmer commented Aug 30, 2017 • edited Loading

codecov-io commented Aug 30, 2017 • edited Loading

Codecov Report

talumbau Sep 1, 2017

Choose a reason for hiding this comment

martinholmer Sep 1, 2017

Choose a reason for hiding this comment

martinholmer commented Sep 2, 2017

MattHJensen Sep 5, 2017

Choose a reason for hiding this comment

martinholmer Sep 5, 2017

Choose a reason for hiding this comment

feenberg commented Sep 5, 2017 via email

martinholmer commented Sep 5, 2017

martinholmer commented Sep 6, 2017

martinholmer commented Sep 6, 2017

feenberg commented Sep 6, 2017 via email

MattHJensen commented Sep 6, 2017

martinholmer commented Sep 6, 2017

feenberg commented Sep 7, 2017 via email

MattHJensen commented Sep 7, 2017

martinholmer commented Aug 30, 2017 •

edited

Loading

codecov-io commented Aug 30, 2017 •

edited

Loading