Improve connection lock handling; always use context manager #1895

dpkp · 2019-09-03T02:27:49Z

This is an extension of #1851 and #1854 -- the underlying issue is that several methods are not handling acquisition and release of the connection lock correctly. I initially tried to address this through more use of self._lock.release(), but after sitting on this change for a while I think I agree that we should rely on the with self._lock: context manager so that the blocks that are synchronized are more obvious and also are resilient to uncaught exceptions.

I continue to believe that we should use a Lock, not an RLock, and one major reason for that is that the close() method calls back into the KafkaClient state change handler. That handler currently acquires the client lock, and so to avoid deadlocks I think we need to make sure that we no longer hold the connection lock when this handler is invoked. If we were to use an RLock, we could not be sure whether releasing the lock fully releases it or not (i.e., we only release our contextual hold, but some outer context may continue to hold the lock at a higher level).

Because of that, we also need to be careful to release the connection lock before we call self.close() and also before we call future.success() or future.failure() because these may trigger callback/errback functions that themselves call close or some other method that may attempt to acquire the conn lock. So I restructured many of the affected blocks to move future failure and close handling out of the lock context manager. This makes the code a bit more difficult to read and maintain, but I think it is necessary at this stage. I'd welcome refactoring attempts for sure, and will continue to think about better approaches to this structure that can help us sane.

This change is

Improve connection lock handling; always use context manager

8e3f299

dpkp requested a review from jeffwidman September 3, 2019 02:27

fixup var name for consistency

3d6c7d6

dpkp merged commit 7a69952 into master Sep 3, 2019

dpkp deleted the kafka_conn_with_lock branch September 3, 2019 15:47

dpkp mentioned this pull request Sep 3, 2019

conn: fix BrokerConnection unreleased lock issues #1851

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve connection lock handling; always use context manager #1895

Improve connection lock handling; always use context manager #1895

dpkp commented Sep 3, 2019 •

edited

Loading

Improve connection lock handling; always use context manager #1895

Improve connection lock handling; always use context manager #1895

Conversation

dpkp commented Sep 3, 2019 • edited Loading

dpkp commented Sep 3, 2019 •

edited

Loading