getting rid of a Func allocation #841

bollhals · 2020-05-20T21:56:10Z

Proposed Changes

The AsyncConsumerWorkService was allocating a new Func for each call to Schedule (Also spotted in the images of #824 due to this roslyn bug

Types of Changes

Bug fix (non-breaking change which fixes issue #NNNN)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause an observable behavior change in existing systems)
Documentation improvements (corrections, new content, etc)
Cosmetic change (whitespace, formatting, etc)

Checklist

I have read the CONTRIBUTING.md document
I have signed the CA (see https://cla.pivotal.io/sign/rabbitmq)
All tests pass locally with my changes
I have added tests that prove my fix is effective or that my feature works
I have added necessary documentation (if appropriate)
Any dependent changes have been merged and published in related repositories

Further Comments

I haven't verified these changes myself, as I just stumbled upon this while looking for some other answer and I quickly edited it in Github only.

bollhals · 2020-05-20T22:18:18Z

You learn something new every day, so you can't field assign a lambda function that calls an instance method. :)
Moved the initialization to the new ctor and also moved the _workPool down there to keep it at one place.

lukebakken · 2020-05-20T22:19:44Z

Speaking of, read the "Remarks" here -

https://docs.microsoft.com/en-us/dotnet/api/system.collections.concurrent.concurrentdictionary-2.getoradd?view=netcore-3.1

bollhals · 2020-05-20T22:26:36Z

True, that's actually a new bug that you've fixed as well. 👍
If it is on the hotpath the question is, how "bad" is the lock for all the gets as we'd only need it really for the add case. Are there alternatives?

bording · 2020-05-20T22:33:29Z

@lukebakken Taking a quick look at this, I'm not sure that remark is really relevant here. Seems like It would only matter if we expect the same model instance to be starting work pools from different threads.

We definitely should be avoiding adding a lock in this path, but it's not clear to me that we need it.

lukebakken · 2020-05-20T22:37:48Z

OK, I'll take your word for it @bording ... I just traced the code a bit and it made my eyes cross. Only connections are supposed to be thread-safe so people abusing IModel instances are warned.

lukebakken · 2020-05-20T22:39:03Z

Of course, the question is why is a ConcurrentDictionary being used in the first place 🤔

bording · 2020-05-20T22:49:56Z

OK, I'll take your word for it @bording ... I just traced the code a bit and it made my eyes cross. Only connections are supposed to be thread-safe so people abusing IModel instances are warned.

This was just a quick look, so don't take what I said as gospel! I'm just trying to think of how you'd have the same model instance calling Schedule from multuple threads in a valid scenario, and I didn't immediately come up with one.

bording · 2020-05-20T22:57:23Z

Of course, the question is why is a ConcurrentDictionary being used in the first place 🤔

We do need to protect against multiple models adding work at the same time and writing to a regular dictionary from multiple threads would require a lock to avoid corrupting the collection.

lukebakken · 2020-05-20T22:59:39Z

You can see why that is confusing... "No need for a lock here, but we do in this other place in the same class"

bording · 2020-05-20T22:59:45Z

It seems like we could look at just removing a shared work service as a concept and let each model have its own work pool, removing the need for a collection at all.

bording · 2020-05-20T23:00:44Z

You can see why that is confusing... "No need for a lock here, but we do in this other place in the same class"

Where is there a lock already?

lukebakken · 2020-05-20T23:01:22Z

Where is there a lock already?

The use of a ConcurrentDictionary.

I'll add the per-model work pools as a future idea.

michaelklishin · 2020-05-20T23:08:10Z

So we removed some allocations but introduced a lock. @stebet can you please run your profiling workload and share if this is a net positive change in terms of CPU usage and allocations?

lukebakken · 2020-05-20T23:09:55Z

So we removed some allocations but introduced a lock

The lock is already gone!

bording · 2020-05-20T23:11:27Z

The use of a ConcurrentDictionary.

Hmm, I wouldn't have classified that as a lock, but I see what you mean.

The distinction here is that we need a ConcurrentDictionary because the collection itself could be modified concurrently, where the GetOrAdd issue is about adding the same key to that collection from separate threads, and if the func being called more than once is a problem.

In this case, we'd end up creating more than one work pool, only one of which would be stored in the collection. The other one would never have work queued to it, so it doesn't seem like it could interfere in any way. It would await forever for the dequeue to return something.

lukebakken · 2020-05-20T23:16:10Z

In this case, we'd end up creating more than one work pool, only one of which would be stored in the collection. The other one would never have work queued to it, so it doesn't seem like it could interfere in any way. It would await forever for the dequeue to return something.

Would the other work pool be leaked or, since there would be no reference to it, would it be cleaned up? My guess is it would be leaked, or the resources used by it leaked... emphasis on the word "guess" 🤷

stebet · 2020-05-20T23:18:51Z

Yeah, I've wondered about this code a few times. I like @bording suggestion to just have each Model keep it's own work queue. I'll run the PR through the profiling tool now though.

stebet · 2020-05-20T23:33:01Z

This does indeed work and ends up in a reduction. Good spot there @bollhals. It wasn't apparent to me where that sneaky Func<> allocation was coming from.

Before:

After:

stebet · 2020-05-20T23:36:39Z

And on that note, I was thinking for 7.0 that it'd be a good idea to get rid of these dispatchers, and just have dedicated Channel instances on the models that an async task would simply be reading from. That way the message deliveries would always run asynchronously and there is no need for a collection to keep track of these instances.

lukebakken · 2020-05-20T23:38:29Z

Thanks @bollhals and everyone else. Interesting discussion!

lukebakken self-assigned this May 20, 2020

lukebakken added this to the 6.1.0 milestone May 20, 2020

getting rid of a Func allocation

1b628e3

bollhals force-pushed the patch-1 branch from 055f7a4 to 1b628e3 Compare May 20, 2020 22:16

lukebakken force-pushed the patch-1 branch from ac4ad8b to 5fc23a4 Compare May 20, 2020 22:24

lukebakken added 2 commits May 20, 2020 15:43

Fix compilation error, add a lock because GetOrAdd is not atomic.

f821abb

Refactor other use of ConcurrentDictionary GetOrAdd

2b85e7e

lukebakken force-pushed the patch-1 branch from 5fc23a4 to 2b85e7e Compare May 20, 2020 22:43

Fix the comment

2860c5c

lukebakken approved these changes May 20, 2020

View reviewed changes

lukebakken merged commit a6bd976 into rabbitmq:master May 20, 2020

lukebakken added the next-gen-todo If a rewrite happens, address this issue. label May 20, 2020

bollhals deleted the patch-1 branch May 21, 2020 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

getting rid of a Func allocation #841

getting rid of a Func allocation #841

bollhals commented May 20, 2020

bollhals commented May 20, 2020

lukebakken commented May 20, 2020

bollhals commented May 20, 2020

bording commented May 20, 2020 •

edited

Loading

lukebakken commented May 20, 2020

lukebakken commented May 20, 2020

bording commented May 20, 2020

bording commented May 20, 2020

lukebakken commented May 20, 2020

bording commented May 20, 2020

bording commented May 20, 2020

lukebakken commented May 20, 2020

michaelklishin commented May 20, 2020

lukebakken commented May 20, 2020 •

edited

Loading

bording commented May 20, 2020

lukebakken commented May 20, 2020

stebet commented May 20, 2020

stebet commented May 20, 2020

stebet commented May 20, 2020

lukebakken commented May 20, 2020

getting rid of a Func allocation #841

getting rid of a Func allocation #841

Conversation

bollhals commented May 20, 2020

Proposed Changes

Types of Changes

Checklist

Further Comments

bollhals commented May 20, 2020

lukebakken commented May 20, 2020

bollhals commented May 20, 2020

bording commented May 20, 2020 • edited Loading

lukebakken commented May 20, 2020

lukebakken commented May 20, 2020

bording commented May 20, 2020

bording commented May 20, 2020

lukebakken commented May 20, 2020

bording commented May 20, 2020

bording commented May 20, 2020

lukebakken commented May 20, 2020

michaelklishin commented May 20, 2020

lukebakken commented May 20, 2020 • edited Loading

bording commented May 20, 2020

lukebakken commented May 20, 2020

stebet commented May 20, 2020

stebet commented May 20, 2020

stebet commented May 20, 2020

lukebakken commented May 20, 2020

bording commented May 20, 2020 •

edited

Loading

lukebakken commented May 20, 2020 •

edited

Loading