Transition the `coverage-linux64` pipeline to Buildkite #41238

DilumAluthge · 2021-06-16T05:16:58Z

No description provided.

DilumAluthge · 2021-06-16T17:56:44Z

I've temporarily disabled the commit statuses. We can re-enable them right before we merge this PR.

In the meantime, go here to view statuses and build logs: https://buildkite.com/julialang/julia-coverage-linux64

DilumAluthge · 2021-06-16T23:51:14Z

@vtjnash Any idea why the coverage is lower here compared to the Buildbot approach? This PR calculates coverage at 79% (58650/73776), but Codecov (which was uploaded from Buildbot) calculates coverage at 88% (65731/75061).

This PR runs the tests in parallel. Could that be the sole reason for the discrepancy, or is there something else that I'm missing?

vtjnash · 2021-06-17T00:04:41Z

You should be able to go to codecov or coveralls and compare the two commits directly

DilumAluthge · 2021-06-17T00:05:57Z

You should be able to go to codecov or coveralls and compare the two commits directly

Ah, we'll have to add a Codecov token to Buildkite first - I'll need @staticfloat's help with that.

DilumAluthge · 2021-06-18T08:45:54Z

Hmmm, looks like it's failing: https://buildkite.com/julialang/julia-coverage-linux64/builds/68

staticfloat · 2021-06-18T16:48:01Z

So I have updated this to now correctly build Julia, and I've also updated the buildkite webui settings to not build in reaction to GH events, but to instead build on a nightly schedule.

staticfloat · 2021-06-18T17:05:06Z

The build is proceeding here: https://buildkite.com/julialang/julia-coverage-linux64/builds/85#8b067d64-5d75-4139-b75a-81289b567f47

DilumAluthge · 2021-06-18T17:06:43Z

I've also updated the buildkite webui settings to not build in reaction to GH events, but to instead build on a nightly schedule.

Nice! This sounds good.

If I want to e.g. trigger a manual job, based on the latest commit to master, without waiting for the next nightly build, will I still be able to trigger a manual coverage job through the Buildkite web UI?

DilumAluthge · 2021-06-18T17:07:39Z

┌ Info:
--
&nbsp; | │   ncores = 1
&nbsp; | │   Sys.CPU_THREADS = 128
&nbsp; | └   Threads.nthreads() = 1

We should set ncores to something more useful. 16?

staticfloat · 2021-06-18T17:13:04Z

will I still be able to trigger a manual coverage job through the Buildkite web UI?

Yep; just click the big green "new build" button in the top right of the pipeline UI.

We should set ncores to something more useful. 16?

Hmmm, yeah. What's the best way to do that? Define JULIA_NUM_THREADS globally, for all jobs? I guess if we really want to run a job single-threaded we can override that, and this allows us to scale our jobs based on the agent itself defining how much parallelism it can sustain.

DilumAluthge · 2021-06-18T17:34:48Z

Hmmm, yeah. What's the best way to do that? Define JULIA_NUM_THREADS globally, for all jobs? I guess if we really want to run a job single-threaded we can override that, and this allows us to scale our jobs based on the agent itself defining how much parallelism it can sustain.

Sounds good to me. Except I would define both JULIA_NUM_THREADS and JULIA_CPU_THREADS.

staticfloat · 2021-06-18T23:17:17Z

Looks like it worked!

DilumAluthge · 2021-06-18T23:21:32Z

Now we just need to uncommon the last line (# Coverage.Codecov.submit_local(fcs)) and make sure the upload works.

staticfloat · 2021-06-19T17:51:17Z

I rebased onto latest master and the tests broke. I regret everything.

DilumAluthge · 2021-06-19T22:24:08Z

I restarted the failing Windows Buildbots, and now all the Buildbots are passing.

Also, I ran a new Buildkite coverage-linux64 job, and if you look at the logs (https://buildkite.com/julialang/julia-coverage-linux64/builds/89), everything passed, including both the upload to Coveralls and the upload to Codecov.

It does look like coverage dropped approximately 6%. I'm guessing that is because of the tests being run in parallel.

In a future PR, I'll add a step that runs a subset of the tests in serial (specifically, the subset on which coverage dropped) so we can get coverage back up.

But I don't think that needs to block this PR.

DilumAluthge · 2021-06-20T05:55:32Z

From my point of view, this PR is good to go now.

) * Transition the `coverage-linux64` pipeline to Buildkite * Simplify, run inside of a sandbox * Upload coverage reports to Codecov and Coveralls * Add `COVERALLS_TOKEN` Co-authored-by: Elliot Saba <[email protected]>

* Transition the `coverage-linux64` pipeline to Buildkite * Simplify, run inside of a sandbox * Upload coverage reports to Codecov and Coveralls * Add `COVERALLS_TOKEN` Co-authored-by: Elliot Saba <[email protected]> (cherry picked from commit 9d5f31e)

vtjnash · 2021-09-09T00:39:36Z

Why is coverage being miscomputed since this PR was merged? https://codecov.io/gh/JuliaLang/julia/compare/6d2c0a7766142cbeb52c21494484038a558d9b33...9d5f31e9231c1d77e24ee820908e32f559e23057

It seems like the wrong flags must be getting passed to the test runners?

DilumAluthge · 2021-09-09T00:46:32Z

We may be passing the wrong flags. Here are the relevant scripts:

Also, we're running the tests in parallel. Buildbot used to run them in serial for coverage. So that may be missing some stuff.

@staticfloat Any ideas?

DilumAluthge · 2021-09-09T00:49:20Z

Hmmm. Here is one possibility.

So, we need to make sure we pass certain command-line flags, right? Including the following:

--code-coverage=all
--sysimage-native-code=no

Now, when we run the tests in parallel, we are using Base.runtests, which uses Distributed to launch workers. What if the above flags are not getting correctly forwarded to the Distributed workers?

DilumAluthge · 2021-09-09T00:50:54Z

For example, I don't think Distributed is correctly forwarding the --sysimage-native-code=no flag.

julia> Base.JLOptions().use_sysimage_native_code
0

julia> Distributed.@distributed vcat for i = 1:20
       Base.JLOptions().use_sysimage_native_code
       end
20-element Vector{Int8}:
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1
 1

vtjnash · 2021-09-09T00:53:14Z

I think test/testenv.jl forwards different flags? Also recommended is --code-coverage=lcov/tracefile-%p.info

c.f. https://github.com/JuliaCI/CoverageBase.jl/blob/master/examples/run_coverage_codecov.sh#L16

* Transition the `coverage-linux64` pipeline to Buildkite * Simplify, run inside of a sandbox * Upload coverage reports to Codecov and Coveralls * Add `COVERALLS_TOKEN` Co-authored-by: Elliot Saba <[email protected]> (cherry picked from commit 9d5f31e)

DilumAluthge added the ci Continuous integration label Jun 16, 2021

DilumAluthge force-pushed the dpa/buildkite-coverage-linux64 branch 4 times, most recently from 9f6d76e to b4375c8 Compare June 16, 2021 06:13

DilumAluthge changed the title ~~[WIP] Transition coverage-linux64 to Buildkite~~ [WIP] Transition the coverage-linux64 pipeline to Buildkite Jun 16, 2021

DilumAluthge force-pushed the dpa/buildkite-coverage-linux64 branch 7 times, most recently from 7171760 to 1d8fc2d Compare June 16, 2021 17:47

DilumAluthge force-pushed the dpa/buildkite-coverage-linux64 branch from 24b8553 to 6aa65ed Compare June 18, 2021 05:36

staticfloat force-pushed the dpa/buildkite-coverage-linux64 branch from 42ccc26 to 2610b80 Compare June 18, 2021 16:47

DilumAluthge changed the title ~~[WIP] Transition the coverage-linux64 pipeline to Buildkite~~ [WIP]Transition the coverage-linux64 pipeline to Buildkite Jun 18, 2021

DilumAluthge changed the title ~~[WIP]Transition the coverage-linux64 pipeline to Buildkite~~ Transition the coverage-linux64 pipeline to Buildkite Jun 18, 2021

Add COVERALLS_TOKEN

4b4e390

staticfloat force-pushed the dpa/buildkite-coverage-linux64 branch from 81f6a44 to 4b4e390 Compare June 19, 2021 17:35

DilumAluthge requested a review from staticfloat June 19, 2021 22:24

staticfloat merged commit 9d5f31e into master Jun 20, 2021

staticfloat deleted the dpa/buildkite-coverage-linux64 branch June 20, 2021 07:09

DilumAluthge mentioned this pull request Jul 8, 2021

CI: Issue to track which CI jobs have been migrated to Buildkite #41509

Closed

34 tasks

KristofferC added backport 1.6 Change should be backported to release-1.6 backport 1.7 labels Jul 19, 2021

KristofferC mentioned this pull request Jul 19, 2021

release-1.7: Backports for 1.7-RC1/1.7-beta4 #41499

Merged

82 tasks

KristofferC mentioned this pull request Jul 26, 2021

Backports for julia-1.6.3 #41554

Merged

75 tasks

KristofferC removed the backport 1.7 label Aug 3, 2021

KristofferC mentioned this pull request Aug 25, 2021

release-1.7: Backports for 1.7-RC1 #41781

Merged

63 tasks

KristofferC removed the backport 1.6 Change should be backported to release-1.6 label Sep 7, 2021

This was referenced Sep 9, 2021

Base.runtests: correctly forward the --sysimage-native-code=no flag if it is provided #42173

Closed

Base.julia_cmd(): correctly forward the --sysimage-native-code=no flag if it is provided #42185

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transition the `coverage-linux64` pipeline to Buildkite #41238

Transition the `coverage-linux64` pipeline to Buildkite #41238

DilumAluthge commented Jun 16, 2021

DilumAluthge commented Jun 16, 2021

DilumAluthge commented Jun 16, 2021 •

edited

Loading

vtjnash commented Jun 17, 2021

DilumAluthge commented Jun 17, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 18, 2021

staticfloat commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 19, 2021

DilumAluthge commented Jun 19, 2021 •

edited

Loading

DilumAluthge commented Jun 20, 2021

vtjnash commented Sep 9, 2021

DilumAluthge commented Sep 9, 2021

DilumAluthge commented Sep 9, 2021

DilumAluthge commented Sep 9, 2021

vtjnash commented Sep 9, 2021

Transition the coverage-linux64 pipeline to Buildkite #41238

Transition the coverage-linux64 pipeline to Buildkite #41238

Conversation

DilumAluthge commented Jun 16, 2021

DilumAluthge commented Jun 16, 2021

DilumAluthge commented Jun 16, 2021 • edited Loading

vtjnash commented Jun 17, 2021

DilumAluthge commented Jun 17, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 18, 2021

staticfloat commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 18, 2021

DilumAluthge commented Jun 18, 2021

staticfloat commented Jun 19, 2021

DilumAluthge commented Jun 19, 2021 • edited Loading

DilumAluthge commented Jun 20, 2021

vtjnash commented Sep 9, 2021

DilumAluthge commented Sep 9, 2021

DilumAluthge commented Sep 9, 2021

DilumAluthge commented Sep 9, 2021

vtjnash commented Sep 9, 2021

Transition the `coverage-linux64` pipeline to Buildkite #41238

Transition the `coverage-linux64` pipeline to Buildkite #41238

DilumAluthge commented Jun 16, 2021 •

edited

Loading

DilumAluthge commented Jun 19, 2021 •

edited

Loading