Add test that compares tbi and std behavioral-response estimates #1840

martinholmer · 2018-01-23T18:33:10Z

This pull request contains a new test that has been added to the test_tbi.py file and is marked pre_release and tbi_vs_std_behavior and requires_pufcsv.

This test assumes _BE_sub equals 0.25 and compares the aggregate tax revenues generated by standard Python Tax-Calculator programming with those generated by calling the tbi.run_nth_year_tax_calc_model() function. The motivation for adding this test is the discussion in issue #1827.

Because of the results generated by the tbi function call are "fuzzed" for PUF privacy reasons, there is no expectation that those results would be identical to the results generated by standard Tax-Calculator calls.

The new test simulates a massive tax reductions caused by reducing the regular-income and pass-through tax rates no higher than 25 percent and raising the personal exemption from zero to $1000 beginning in 2019. This reform causes a substantial reduction in marginal tax rates and hence a substantial behavioral response.

However, despite the discussion in #1827, there are no significant differences --- meaning more than a 0.2 percent difference in aggregate tax revenues --- in the results generated using the tbi function call and the results generated using standard (that is, non-tbi) function calls. Here is how I ran the test (using code at the tip of the master branch) and here is what I got:

$ cd taxcalc
$ py.test -m tbi_vs_std_behavior
============================= test session starts ==============================
platform darwin -- Python 2.7.14, pytest-3.2.1, py-1.4.34, pluggy-0.4.0
rootdir: /Users/mrh/work/OSPC/tax-calculator, inifile: setup.cfg
plugins: xdist-1.17.1
collected 436 items                                                             

tests/test_tbi.py .

============================= 435 tests deselected =============================
================== 1 passed, 435 deselected in 120.94 seconds ==================

Why do these test results seem different from the results reported in #1827?
Perhaps I've made a mistake in writing the test.
Or perhaps the code in 0.14.2 is different from the code at the tip of the master branch.
Or perhaps mistakes were made in the work reported in #1827.

Does anybody have any ideas about this?

@MattHJensen @GoFroggyRun

codecov-io · 2018-01-23T18:40:42Z

Codecov Report

Merging #1840 into master will not change coverage.
The diff coverage is n/a.

@@          Coverage Diff           @@
##           master   #1840   +/-   ##
======================================
  Coverage     100%    100%           
======================================
  Files          37      37           
  Lines        3103    3103           
======================================
  Hits         3103    3103

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c0beac4...5860fc3. Read the comment docs.

GoFroggyRun · 2018-01-23T18:53:19Z

@martinholmer If this test is based on current master, then things have changed a lot since version 0.14.2. Most importantly, the current law in current master is TCJA, for 0.14.2 version on the other hand, its current law was still pre-TCJA. If I were not mistaken, the tbi interface is only capable of calculating assumptions based on current law.

martinholmer · 2018-01-23T19:07:47Z

@GoFroggyRun said about pull request #1840:

If this test is based on current master, then things have changed a lot since version 0.14.2. Most importantly, the current law in current master is TCJA, for 0.14.2 version on the other hand, its current law was still pre-TCJA. If I were not mistaken, the tbi interface is only capable of calculating assumptions based on current law.

So what? The main point is that you reported in #1827 that Tax-Calculator produced different results depending on whether you used standard (non-tbi) function calls or a call to the tbi.run_nth_year_tax_calc_model function. In the new #1840 test, the results generated from using Tax-Calculator in those two different ways are the same.

The questions I posed in my original #1840 comment are still unanswered. Can you help by answering them?

GoFroggyRun · 2018-01-23T19:17:09Z

@martinholmer asked:

Why do these test results seem different from the results reported in #1827?

Why my answer doesn't answer your question?

The current master is essentially different from 0.14.2 in terms of current law, and thus would affect how behavioral assumptions work via tbi.
You are not using the exact reform file in Are tbi behavioral-response results for TCJA reform different from non-tbi results? #1827 (which doesn't matter that much because of item 1).

Given these, why would you expect your test in #1840 could replicate the bug report in #1827?

MattHJensen · 2018-01-23T19:45:20Z

I would be interested to know whether replicating the exact same baseline and reform from #1827 on the tip of the master branch also produces nonsensical results. That seems more relevant than knowing whether there was/is a bug in 0.14.1.

Add test that compares tbi and std behavioral responses

5860fc3

martinholmer added the ready label Jan 23, 2018

martinholmer mentioned this pull request Jan 24, 2018

Are tbi behavioral-response results for TCJA reform different from non-tbi results? #1827

Closed

This was referenced Jan 24, 2018

Add test that compares tbi and std behavioral responses #1843

Closed

The diagnostic_table method gives inconsistent results in presence of behavior responses #1845

Closed

martinholmer merged commit 2cc6f18 into PSLmodels:master Jan 27, 2018

martinholmer deleted the tbi-beh branch January 27, 2018 17:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test that compares tbi and std behavioral-response estimates #1840

Add test that compares tbi and std behavioral-response estimates #1840

martinholmer commented Jan 23, 2018 •

edited

Loading

codecov-io commented Jan 23, 2018 •

edited

Loading

GoFroggyRun commented Jan 23, 2018 •

edited

Loading

martinholmer commented Jan 23, 2018

GoFroggyRun commented Jan 23, 2018 •

edited

Loading

MattHJensen commented Jan 23, 2018 •

edited

Loading

Add test that compares tbi and std behavioral-response estimates #1840

Add test that compares tbi and std behavioral-response estimates #1840

Conversation

martinholmer commented Jan 23, 2018 • edited Loading

codecov-io commented Jan 23, 2018 • edited Loading

Codecov Report

GoFroggyRun commented Jan 23, 2018 • edited Loading

martinholmer commented Jan 23, 2018

GoFroggyRun commented Jan 23, 2018 • edited Loading

MattHJensen commented Jan 23, 2018 • edited Loading

martinholmer commented Jan 23, 2018 •

edited

Loading

codecov-io commented Jan 23, 2018 •

edited

Loading

GoFroggyRun commented Jan 23, 2018 •

edited

Loading

GoFroggyRun commented Jan 23, 2018 •

edited

Loading

MattHJensen commented Jan 23, 2018 •

edited

Loading