Adding codecov causes spurious PR check failures #2727

jaraco · 2021-07-11T14:37:41Z

In #2693, this project added codecov back in. Following that addition, I created a pull request (#2726) that didn't change any line of code but still somehow caused a check failure (indicating that code coverage was reduced by .04% and causing the checks to fail.

I'm hoping @tanvimoharir or @webknjaz can figure out how to either:

fix the root cause of these spurious check failures, or
devise a project-agnostic way to make the checks less sensitive (allowed to pass unless there are significant regressions in coverage).

I guess I'd also entertain simply disabling the ability for the checks to fail (report coverage, but never fail), which would be comparable to the prior behavior.

The reason I want the solution to be project-agnostic is because I'd like to be able to apply this technique across a suite of projects and I don't want each project to have to custom-tune the coverage in order to reach a reasonably quiet check. It's okay if a project wants to tune to be more sensitive, but the default should be to produce far less noise than signal.

tanvimoharir · 2021-07-14T09:24:16Z

It does show coverage change:
https://app.codecov.io/gh/pypa/setuptools/compare/2726/changes

I will try to find the root cause (referring https://docs.codecov.com/docs/unexpected-coverage-changes)

tanvimoharir · 2021-07-22T16:18:22Z

It seems like there are around 7 lines under setuptools/command/test.py which is causing this difference:
If you see lines 241-246 and 248 at
https://app.codecov.io/gh/pypa/setuptools/compare/2726/changes#D1L241

it shows that these lines were hit at the base but were not hit at head which is to say that disabling gcov introduced it?
I checked the build reports which were uploaded to codecov manually to confirm the same.
I'm trying to understand the test itself.

webknjaz · 2021-07-22T16:56:10Z

I think what happened was a result of some environment change in CI. It may've been caused by the test env deps being unpinned as well.

If you looks at the older coverage here https://codecov.io/gh/pypa/setuptools/src/3f20c10f04c7777a3dafa3033be05cc07d9e0bb0/setuptools/command/test.py#L241 and the newer one here https://codecov.io/gh/pypa/setuptools/src/ef9b8dd0b12de5833a7967bea8719cd33cea216a/setuptools/command/test.py#L241, you may notice that in the "better covered" commit, there are just 2 hits on those lines (and 0 in the following commits).

This makes me think that the if-clause was True only in 2 jobs (or maybe just one job but two tests hit it). Then, some transitive deps got updated, or the GHA VMs got updates, and that conditional became False in all of the envs we have.

The coverage drop looks legit. Also, I must note that the coverage is not measured by Codecov, it's only reported there. One way to improve the debugability would be to store those XML reports as artifacts in the CI so we could download them and check locally next time this happens. We could also produce HTML coverage to include better context information on what lines were hit by which tests and so on.

webknjaz · 2021-07-22T16:59:58Z

https://codecov.io/gh/pypa/setuptools/src/3f20c10f04c7777a3dafa3033be05cc07d9e0bb0/setuptools/command/test.py#L241 has flags at the top. I've toggled them and discovered that those lines were originally only covered by tests running against Python 3.6 under macOS. So that's what must've changed!

webknjaz · 2021-07-22T17:04:27Z

I compared https://github.com/pypa/setuptools/runs/3038459747 and https://github.com/pypa/setuptools/runs/3040476882 but this doesn't give me any clue about what changed — they seem identical (OS, CPython, libs are all the same).

webknjaz · 2021-07-22T17:05:15Z

FWIW it's possible to change the threshold in Codecov if you want it to be lower. It can be added to .codecov.yml that is already present in the project root.

jaraco · 2021-07-23T02:14:01Z

Yes. Lowering the threshold seems suitable. Even to zero would be fine and still enable reporting of coverage. Better would be to catch severe but real loss of coverage.

webknjaz · 2021-07-23T07:15:37Z

I'm pretty sure that this loss was real.

tanvimoharir · 2021-08-04T12:09:33Z

Should we go ahead with changing the threshold limit?

webknjaz · 2021-08-05T15:19:47Z

Yes. You may use this https://github.com/ansible-community/ansible-compat/blob/main/codecov.yml as a reference.

OTOH, it'd be useful to also figure out why that chunk stopped being covered and try to understand how to recover it.

tanvimoharir · 2021-08-30T16:55:01Z

Yes. You may use this https://github.com/ansible-community/ansible-compat/blob/main/codecov.yml as a reference.

OTOH, it'd be useful to also figure out why that chunk stopped being covered and try to understand how to recover it.

I have added the same configurations for now (in a draft PR)
How much should the threshold value generally be?
Yes, I will try to investigate further into the failures.

jaraco added a commit that referenced this issue Jul 18, 2021

Disable codecov. Ref #2727.

d5c86aa

tanvimoharir mentioned this issue Aug 30, 2021

Update codecov.yml #2762

Merged

2 tasks

jaraco closed this as completed in #2762 Oct 20, 2021

jaraco mentioned this issue Jan 30, 2022

[CI] Improve coverage config #3050

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding codecov causes spurious PR check failures #2727

Adding codecov causes spurious PR check failures #2727

jaraco commented Jul 11, 2021

tanvimoharir commented Jul 14, 2021

tanvimoharir commented Jul 22, 2021 •

edited

Loading

webknjaz commented Jul 22, 2021

webknjaz commented Jul 22, 2021

webknjaz commented Jul 22, 2021

webknjaz commented Jul 22, 2021

jaraco commented Jul 23, 2021 via email

webknjaz commented Jul 23, 2021

tanvimoharir commented Aug 4, 2021

webknjaz commented Aug 5, 2021

tanvimoharir commented Aug 30, 2021

Adding codecov causes spurious PR check failures #2727

Adding codecov causes spurious PR check failures #2727

Comments

jaraco commented Jul 11, 2021

tanvimoharir commented Jul 14, 2021

tanvimoharir commented Jul 22, 2021 • edited Loading

webknjaz commented Jul 22, 2021

webknjaz commented Jul 22, 2021

webknjaz commented Jul 22, 2021

webknjaz commented Jul 22, 2021

jaraco commented Jul 23, 2021 via email

webknjaz commented Jul 23, 2021

tanvimoharir commented Aug 4, 2021

webknjaz commented Aug 5, 2021

tanvimoharir commented Aug 30, 2021

tanvimoharir commented Jul 22, 2021 •

edited

Loading