fix flaky test on test_dictionary.py::test_dictionary_looping #2105

lonly7star · 2021-10-11T20:06:38Z

This PR aims to fix the flaky test on test_dictionary.py::test_dictionary_looping so the test could pass for multiple test runs

The result

The test can pass when running only once, but fail when running the test suit multiple times. When asserting the key pair does not exist on the global_err_dicts or global_pairs, the pair actually exists start on the second test run.

Steps to reproduce the issue

install pytest flaky finder with pip install pytest-flakefinder
run pytest with flake-finder with command pytest -k test_dictionary.py --flake-finder

Issue of the code

The reason is the code tried to assert the key pair or error read from the file doesn't exist. However, the global_err_dicts and global_pairs only initialize once per pytest runs, so start from the second run of test_dictionary_looping, the assert will fail.

Proposed solution

Adding a variable to track how many txt files we had read. With a pytest fixture to initialize both global_err_dicts andglobal_pairs` if the variable indicates it is a new test run.

update branch

lonly7star · 2021-10-12T02:27:34Z

should I re-formate the code with the format requirements of codespell and submit a new PR?

peternewman · 2021-10-16T15:15:29Z

This PR aims to fix the flaky test on test_dictionary.py::test_dictionary_looping so the test could pass for multiple test runs

The test can pass when running only once, but fail when running the test suit multiple times. When asserting the key pair does not exist on the global_err_dicts or global_pairs, the pair actually exists start on the second test run.

I assume you've not had this test fail on a single run, so it's not flaky in the traditional sense but it's not compatible with re-running the test multiple times in the same session?

I'm not targeting this at you, but this fix seems like quite a bodge to me, isn't there some set up and tear down functions for the test (rather than the test suite) which we could deal with this in? Flaky ( https://github.com/box/flaky ) seems to support this stuff, based on issues like ( box/flaky#124 box/flaky#109 box/flaky#53 box/flaky#39 ). Maybe pytest-flakefinder does too and we just need to add setup/tearDown or add these to it?

should I re-formate the code with the format requirements of codespell and submit a new PR?

You can just fix it in this PR if you'd prefer.

update the test file

lonly7star · 2021-10-22T07:59:00Z

I assume you've not had this test fail on a single run, so it's not flaky in the traditional sense but it's not compatible with re-running the test multiple times in the same session?

Thank you for reply my PR! I'm a master's student that trying to fix flaky tests as a course project. In the definition, a flaky test means a test could produce different results with the testing code unchanged. In our case, when a testing function fail when re-run multiple times in the same session is considered flaky.

isn't there some setup and tear-down functions for the test (rather than the test suite) which we could deal with this in?

So the general tear down/fixture does not really fit for test_dictionary_looping it is a parameterized test and test_ran_all depends on the result of it, which means when we re-run either the full test file or just the testing function. test_dictionary_looping will fail if we don't clean up after all of the parameters are feed in. And since the test_ran_all function depends on the result, it will fail if we clean up early.

That is why I used a counter for the looping times in a fixture so I could clean at the right time. In re-runs, using @pytest.mark.dependency only provides the running priority but is not necessary right after the desired loop times when all parameters are fed.
I only raised this PR in the perspective of the flaky test and it is totally fine if you feel re-runs will not be a concern of this project. Or we could discuss the manul tear-down for passing re-runs.

Liu, Sichen and others added 4 commits October 11, 2021 11:47

fix flaky test on test dictionary looping

a654d31

remove print line

b0622d8

Merge pull request #2 from lonly7star/master

cfe19b3

update branch

remove extra space

b78c268

Merge pull request #3 from lonly7star/master

fb54305

update the test file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix flaky test on test_dictionary.py::test_dictionary_looping #2105

fix flaky test on test_dictionary.py::test_dictionary_looping #2105

lonly7star commented Oct 11, 2021

lonly7star commented Oct 12, 2021

peternewman commented Oct 16, 2021

lonly7star commented Oct 22, 2021

fix flaky test on test_dictionary.py::test_dictionary_looping #2105

Are you sure you want to change the base?

fix flaky test on test_dictionary.py::test_dictionary_looping #2105

Conversation

lonly7star commented Oct 11, 2021

The result

Steps to reproduce the issue

Issue of the code

Proposed solution

lonly7star commented Oct 12, 2021

peternewman commented Oct 16, 2021

lonly7star commented Oct 22, 2021