Measure GIL contention in benchmarks #937

pentschev · 2023-03-24T12:30:59Z

Add argument to measure and report GIL contention in benchmarks. This may provide some useful insight when optimizing asyncio-related changes.

Sample results observed are below.

Small messages (1 B)

$ python -m ucp.benchmarks.send_recv --backend ucp-core --n-bytes 1 --n-iter 100_000 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.688429594039917

$ python -m ucp.benchmarks.send_recv --backend ucp-async --n-bytes 1 --n-iter 100_000 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.4772491455078125

$ UCXPY_NON_BLOCKING_MODE=1 python -m ucp.benchmarks.send_recv --backend ucp-async --n-bytes 1 --n-iter 100_000 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.8763511180877686

Medium messages (1 MiB)

$ python -m ucp.benchmarks.send_recv --backend ucp-core --n-bytes 1MiB --n-iter 100 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.1721574366092682

$ python -m ucp.benchmarks.send_recv --backend ucp-async --n-bytes 1MiB --n-iter 100 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.25215959548950195

$ UCXPY_NON_BLOCKING_MODE=1 python -m ucp.benchmarks.send_recv --backend ucp-async --n-bytes 1MiB --n-iter 100 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.2693021595478058

Large messages (1 GiB)

$ python -m ucp.benchmarks.send_recv --backend ucp-core --n-bytes 1 GiB --n-iter 10 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.5908554792404175

$ python -m ucp.benchmarks.send_recv --backend ucp-async --n-bytes 1GiB --n-iter 10 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.6251853108406067

$ UCXPY_NON_BLOCKING_MODE=1 python -m ucp.benchmarks.send_recv --backend ucp-async --n-bytes 1GiB --n-iter 10 --no-detailed-report --report-gil-contention
...
GIL contention            | 0.832139790058136

Add argument to measure and report GIL contention in benchmarks. This may provide some useful insight when optimizing asyncio-related changes.

quasiben · 2023-03-24T13:50:57Z

This is really cool! Can you describe what the output results are ? Is it time, a ratio, etc ?

pentschev · 2023-03-24T14:01:03Z

It is a ratio of the perceived time the user code (in this case, UCX-Py send/recv messages) had the GIL. Thus 0.0 would mean the user code never took the GIL, whereas 1.0 would mean the user code had the GIL for the entirety of time without releasing it.

jacobtomlinson · 2023-03-24T14:25:35Z

This looks neat! The Dask dashboard has a plot for this too, it's under the "more" list and is called "Contention". There were some performance implications with gilknocker==0.3.0 but seems much better in 0.4.0.

pentschev · 2023-03-24T14:52:37Z

This looks neat! The Dask dashboard has a plot for this too, it's under the "more" list and is called "Contention". There were some performance implications with gilknocker==0.3.0 but seems much better in 0.4.0.

Yes, Ben pointed me to that, and that's where I stole the idea from.

copy-pr-bot · 2023-10-13T08:06:43Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

pentschev · 2023-10-13T08:32:57Z

/ok to test

wence-

Minor nits, looks good though.

Interested to know how contended things are.

ucp/benchmarks/backends/tornado.py

ucp/benchmarks/backends/ucp_core.py

ucp/benchmarks/send_recv.py

Co-authored-by: Lawrence Mitchell <[email protected]>

pentschev · 2023-10-13T13:50:58Z

Interested to know how contended things are.

Not sure if you mean something different, but I reported contention numbers in the description. Did you overlook that or are you asking for something else?

pentschev · 2023-10-13T13:57:54Z

/ok to test

wence- · 2023-10-13T13:58:35Z

Interested to know how contended things are.

Not sure if you mean something different, but I reported contention numbers in the description. Did you overlook that or are you asking for something else?

Oh no, I'm just blind...

pentschev · 2023-10-13T14:55:35Z

Thanks @wence- !

pentschev · 2023-10-13T14:55:40Z

/merge

Measure GIL contention in benchmarks

8eaf1e7

Add argument to measure and report GIL contention in benchmarks. This may provide some useful insight when optimizing asyncio-related changes.

pentschev requested a review from a team as a code owner March 24, 2023 12:30

pentschev changed the base branch from branch-0.31 to branch-0.35 October 13, 2023 08:04

Merge branch 'branch-0.35' into benchmark-gil-contention

2accb9c

wence- approved these changes Oct 13, 2023

View reviewed changes

ucp/benchmarks/backends/tornado.py Outdated Show resolved Hide resolved

ucp/benchmarks/backends/ucp_core.py Outdated Show resolved Hide resolved

ucp/benchmarks/send_recv.py Outdated Show resolved Hide resolved

Remove duplicate stop() calls and rename to --report-gil-contention

700af6b

Co-authored-by: Lawrence Mitchell <[email protected]>

Merge branch 'branch-0.35' into benchmark-gil-contention

780d9ae

rapids-bot bot merged commit 9c17700 into rapidsai:branch-0.35 Oct 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Measure GIL contention in benchmarks #937

Measure GIL contention in benchmarks #937

pentschev commented Mar 24, 2023

quasiben commented Mar 24, 2023

pentschev commented Mar 24, 2023

jacobtomlinson commented Mar 24, 2023 •

edited

Loading

pentschev commented Mar 24, 2023

copy-pr-bot bot commented Oct 13, 2023

pentschev commented Oct 13, 2023

wence- left a comment

pentschev commented Oct 13, 2023

pentschev commented Oct 13, 2023

wence- commented Oct 13, 2023

pentschev commented Oct 13, 2023

pentschev commented Oct 13, 2023

Measure GIL contention in benchmarks #937

Measure GIL contention in benchmarks #937

Conversation

pentschev commented Mar 24, 2023

quasiben commented Mar 24, 2023

pentschev commented Mar 24, 2023

jacobtomlinson commented Mar 24, 2023 • edited Loading

pentschev commented Mar 24, 2023

copy-pr-bot bot commented Oct 13, 2023

pentschev commented Oct 13, 2023

wence- left a comment

Choose a reason for hiding this comment

pentschev commented Oct 13, 2023

pentschev commented Oct 13, 2023

wence- commented Oct 13, 2023

pentschev commented Oct 13, 2023

pentschev commented Oct 13, 2023

jacobtomlinson commented Mar 24, 2023 •

edited

Loading