bluetooth: buf: Add a callback for freed buffer in rx pool #81646

PavelVPV · 2024-11-20T09:34:45Z

The Bluetooth data buffer API currently lacks a mechanism to notify when
a buffer is freed in the RX pool. This limitation forces HCI drivers to
adopt inefficient workarounds to manage buffer allocation.

HCI drivers face two suboptimal options:

Blocking calls: Use bt_buf_get_rx with K_FOREVER, which blocks the
execution context until a buffer becomes available.
Polling: Repeatedly call bt_buf_get_rx with K_NO_WAIT, which increases
CPU load and reduces efficiency.

This commit introduces a callback mechanism that is triggered each time
a buffer is freed in the RX pool. With this feature, HCI drivers can:

Call bt_buf_get_rx with K_NO_WAIT.
Wait for the callback notification if a NULL buffer is returned,
avoiding unnecessary polling.

The new callback improves efficiency by enabling event-driven behavior
for buffer management, reducing CPU overhead while maintaining
responsiveness.

alwa-nordic · 2024-11-20T14:17:27Z

We have gotten feedback about lack of thread safety in the Host. I think we should make sure all code we add is thread safe, or have the API documentation say exactly what to synchronize on to safely call the API, since we have committed to this.

I have sketched some (compile time tested) changes at alwa-nordic@03b0d39.

In the sketch, I chose to make it thread safe. I also considered that an application may collide with the driver and both will call the cb_set function. To prevent "silent corruption" in that case, I changed it from a single cb_set to a cb_register and cb_unregister.

Apart from this, I removed the dependency injection from buf.c to iso.c and instead make iso.c call buf.c directly. I think this simplifies the code (at the cost of adding a circular dependency).

alwa-nordic · 2024-11-20T14:19:41Z

I think we can assign single-bit values to enum bt_buf_type, and then we wouldn't need enum bt_buf_type_bit.

rugeGerritsen · 2024-11-20T15:11:06Z

We have gotten feedback about lack of thread safety in the Host. I think we should make sure all code we add is thread safe, or have the API documentation say exactly what to synchronize on to safely call the API, since we have committed to this.

I have sketched some (compile time tested) changes at alwa-nordic@03b0d39.

In the sketch, I chose to make it thread safe. I also considered that an application may collide with the driver and both will call the cb_set function. To prevent "silent corruption" in that case, I changed it from a single cb_set to a cb_register and cb_unregister.

Apart from this, I removed the dependency injection from buf.c to iso.c and instead make iso.c call buf.c directly. I think this simplifies the code (at the cost of adding a circular dependency).

Can you elaborate why we want bt_buf_rx_freed_cb_set() to be thread safe? AFAIK, the RX buffers are only used when passed from controller to host, so the application should not call this API.

alwa-nordic · 2024-11-20T15:23:14Z

Can you elaborate why we want bt_buf_rx_freed_cb_set() to be thread safe? AFAIK, the RX buffers are only used when passed from controller to host, so the application should not call this API.

Thread safe is the default, as stated in documentation. The alternative is to document the API's thread safety aspects. It may be right that this API does not have to be thread safe, I'm not arguing for or against this right now, but then the documentation of this API has to do some heavy lifting.

alwa-nordic · 2024-11-20T16:10:51Z

Food for thought: buf_rx_freed_notify as implemented is invoked from net_buf destroy callbacks. Those come from potentially arbitrary threads.

bt_buf_rx_freed_cb_t may have to be designated thread safe in case multiple threads call destroy concurrently.

Without synchronization with buf_rx_freed_notify, bt_buf_rx_freed_cb_set is safe to call only when the caller can guarantee a concurrent buf_rx_freed_notify is impossible. At what times can the caller infer this guarantee?

PavelVPV · 2024-11-21T06:53:12Z

We have gotten feedback about lack of thread safety in the Host. I think we should make sure all code we add is thread safe, or have the API documentation say exactly what to synchronize on to safely call the API, since we have committed to this.

Thread safe is the default, as stated in documentation. The alternative is to document the API's thread safety aspects. It may be right that this API does not have to be thread safe, I'm not arguing for or against this right now, but then the documentation of this API has to do some heavy lifting.

If you can come up with the scenario where this function is called from multiple contexts then I'll make it thread safe. But from my understanding there is only a single user of this API. More other, only 1 callback can be set, so calling this function from different contexts means that there is misunderstanding of the purpose of this API, which in my understanding should be called once at the initialization phase. Therefore, I'd rather update the API description.

PavelVPV · 2024-11-21T06:57:31Z

Food for thought: buf_rx_freed_notify as implemented is invoked from net_buf destroy callbacks. Those come from potentially arbitrary threads.

bt_buf_rx_freed_cb_t may have to be designated thread safe in case multiple threads call destroy concurrently.

Without synchronization with buf_rx_freed_notify, bt_buf_rx_freed_cb_set is safe to call only when > the caller can guarantee a concurrent buf_rx_freed_notify is impossible. At what times can the caller infer this guarantee?

With this I fully agree. I'll make the callback thread safe.

PavelVPV · 2024-11-21T06:58:07Z

Apart from this, I removed the dependency injection from buf.c to iso.c and instead make iso.c call buf.c directly. I think this simplifies the code (at the cost of adding a circular dependency).

I think I disagree with this suggestion precisely because it creates dependency of iso.c on buf.c. The dependency injection from buf.c to iso.c already exists.

alwa-nordic · 2024-11-21T09:43:09Z

If you can come up with the scenario where this function is called from multiple contexts then I'll make it thread safe. But from my understanding there is only a single user of this API. More other, only 1 callback can be set, so calling this function from different contexts means that there is misunderstanding of the purpose of this API, which in my understanding should be called once at the initialization phase. Therefore, I'd rather update the API description.

I don't have any scenario in mind. I am in agreement with you if we update the API documentation so the user is informed about when it's safe to call the API, and when it's safe to free the callback.

I would like any assumptions we make about our own code to be documented and treated like internal API. Let's see what assumptions we need to make, if any. We could put it in code comments or create an architecture design document.

I'll make the callback thread safe.

It's a good idea to document this requirement on bt_buf_rx_freed_cb_t.

tests/bluetooth/buf/src/main.c

alwa-nordic · 2024-11-26T12:08:38Z

tests/bluetooth/buf/src/main.c

+		zassert_equal(test_vector[i].exp_type_mask, freed_buf_type,
+			      "Unexpected buffer type");


This should not test for equality, but just the expected bit. Extra bits should not fail this test, right? Like for hci_raw that uses the same buffer pool for all pool types.

Assuming the above is done, exp_type_mask in the test vector is over-fitting to the implementation. We should get rid of exp_type_mask and instead just check that type bit is set in the callback-provided value.

I'd rather add a dedicated test for hci_raw that checks that it passes all 3 types. Then we know exactly what each variant returns.

It seems we want to test different things.

I want a test which accepts any implementation that follows the API specification. You want a test that fully specifies the behavior of the implementation with no wiggle room.

Both types of tests are useful. Your type is useful for enforcing strict refactoring without behavior change. My type is useful for testing multiple implementations or a re-implementation that takes advantage of the wiggle room in the specification.

Maybe we should have both tests? One test for the API, and one for each fully-specified implementation.

Then I suggest a different test where this wiggle room is not needed: 5f1f5d9

alwa-nordic · 2024-11-26T12:53:32Z

tests/bluetooth/buf/testcase.yaml

+      - native_sim
+      - native_sim/native/64
+    integration_platforms:
+      - native_sim


Should we add extra_configs here as well to make sure the config is as we expect, no matter what the default is?

Right, though if we check the exact returned value, this may not be needed. I added CONFIG_BT_HCI_ACL_FLOW_CONTROL=y any way.

jhedberg · 2024-12-09T14:37:58Z

@PavelVPV there's a conflict that needs resolving - please rebase.

This allows to combine several types in a single value. Signed-off-by: Pavel Vasilyev <[email protected]>

The Bluetooth data buffer API currently lacks a mechanism to notify when a buffer is freed in the RX pool. This limitation forces HCI drivers to adopt inefficient workarounds to manage buffer allocation. HCI drivers face two suboptimal options: - Blocking calls: Use bt_buf_get_rx with K_FOREVER, which blocks the execution context until a buffer becomes available. - Polling: Repeatedly call bt_buf_get_rx with K_NO_WAIT, which increases CPU load and reduces efficiency. This commit introduces a callback mechanism that is triggered each time a buffer is freed in the RX pool. With this feature, HCI drivers can: - Call bt_buf_get_rx with K_NO_WAIT. - Wait for the callback notification if a NULL buffer is returned, avoiding unnecessary polling. The new callback improves efficiency by enabling event-driven behavior for buffer management, reducing CPU overhead while maintaining responsiveness. Signed-off-by: Pavel Vasilyev <[email protected]>

This commit adds a unit test that checks the freed buffer callback of the bluetooth data buffer API. Signed-off-by: Pavel Vasilyev <[email protected]>

PavelVPV · 2024-12-09T20:12:31Z

@PavelVPV there's a conflict that needs resolving - please rebase.

Done

LuoZhongYao · 2024-12-24T06:46:51Z

Because enum bt_buf_type is now a bitmap, and struct bt_buf_data.type is uint8_t, and in the foreseeable future, bt_buf_type will increase, should we consider changing struct bt_buf_data.type to uint32_t?

zephyrbot added area: Bluetooth Host Bluetooth Host (excluding BR/EDR) area: Bluetooth area: Bluetooth ISO Bluetooth LE Isochronous Channels labels Nov 20, 2024

zephyrbot requested review from alwa-nordic, hermabe, jhedberg, kruithofa, rugeGerritsen, sjanc, Thalley and theob-pro November 20, 2024 09:35

zephyrbot assigned jhedberg and alwa-nordic Nov 20, 2024

PavelVPV mentioned this pull request Nov 20, 2024

bluetooth: hci_driver: Fix deadlock in MPSL workq nrfconnect/sdk-nrf#18953

Merged

PavelVPV force-pushed the hci_driver_async_buf_get_upstream branch 2 times, most recently from 4dc4ff3 to f9e1bef Compare November 20, 2024 12:17

benediktibk mentioned this pull request Nov 20, 2024

build of sample psa/its fails #81639

Closed

PavelVPV force-pushed the hci_driver_async_buf_get_upstream branch from f9e1bef to a41ae5b Compare November 21, 2024 07:09

PavelVPV force-pushed the hci_driver_async_buf_get_upstream branch 2 times, most recently from b02b581 to e203e24 Compare November 22, 2024 08:21

jhedberg previously approved these changes Nov 22, 2024

View reviewed changes

alwa-nordic reviewed Nov 26, 2024

View reviewed changes

tests/bluetooth/buf/src/main.c Outdated Show resolved Hide resolved

alwa-nordic reviewed Nov 26, 2024

View reviewed changes

PavelVPV dismissed jhedberg’s stale review via 88ca011 November 26, 2024 15:07

PavelVPV force-pushed the hci_driver_async_buf_get_upstream branch from e930b25 to 88ca011 Compare November 26, 2024 15:07

Thalley removed their request for review November 26, 2024 16:14

PavelVPV force-pushed the hci_driver_async_buf_get_upstream branch from 88ca011 to e01cbc8 Compare December 9, 2024 13:13

PavelVPV requested review from jhedberg and alwa-nordic December 9, 2024 13:13

zephyrbot requested a review from cvinayak December 9, 2024 13:14

alwa-nordic previously approved these changes Dec 9, 2024

View reviewed changes

theob-pro previously approved these changes Dec 9, 2024

View reviewed changes

PavelVPV added 3 commits December 9, 2024 21:03

bluetooth: buf: Convert bt_buf_type enum to bitmask

91ed10f

This allows to combine several types in a single value. Signed-off-by: Pavel Vasilyev <[email protected]>

tests: bluetooth: buf: Test the freed buf callback

2e32342

This commit adds a unit test that checks the freed buffer callback of the bluetooth data buffer API. Signed-off-by: Pavel Vasilyev <[email protected]>

PavelVPV dismissed stale reviews from theob-pro and alwa-nordic via 2e32342 December 9, 2024 20:11

PavelVPV force-pushed the hci_driver_async_buf_get_upstream branch from e01cbc8 to 2e32342 Compare December 9, 2024 20:11

PavelVPV requested review from alwa-nordic and theob-pro December 9, 2024 20:12

jhedberg approved these changes Dec 9, 2024

View reviewed changes

theob-pro approved these changes Dec 10, 2024

View reviewed changes

kartben merged commit 0d06691 into zephyrproject-rtos:main Dec 10, 2024
27 checks passed

This was referenced Dec 10, 2024

[nrf fromtree] bluetooth: host: downstream bluetooth/buf.h API change nrfconnect/sdk-zephyr#2353

Merged

bluetooth: host: downstream bluetooth/buf.h API change nrfconnect/sdk-nrf#19412

Merged

ubieda mentioned this pull request Dec 31, 2024

Bluetooth: API to signal host-managed buffers are freed #77249

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bluetooth: buf: Add a callback for freed buffer in rx pool #81646

bluetooth: buf: Add a callback for freed buffer in rx pool #81646

PavelVPV commented Nov 20, 2024

alwa-nordic commented Nov 20, 2024 •

edited

Loading

alwa-nordic commented Nov 20, 2024 •

edited

Loading

rugeGerritsen commented Nov 20, 2024

alwa-nordic commented Nov 20, 2024 •

edited

Loading

alwa-nordic commented Nov 20, 2024

PavelVPV commented Nov 21, 2024

PavelVPV commented Nov 21, 2024

PavelVPV commented Nov 21, 2024

alwa-nordic commented Nov 21, 2024

alwa-nordic Nov 26, 2024

alwa-nordic Nov 26, 2024

PavelVPV Nov 26, 2024

alwa-nordic Nov 27, 2024 •

edited

Loading

PavelVPV Nov 27, 2024

alwa-nordic Nov 26, 2024

PavelVPV Nov 26, 2024

jhedberg commented Dec 9, 2024

PavelVPV commented Dec 9, 2024

LuoZhongYao commented Dec 24, 2024

		zassert_equal(test_vector[i].exp_type_mask, freed_buf_type,
		"Unexpected buffer type");

bluetooth: buf: Add a callback for freed buffer in rx pool #81646

bluetooth: buf: Add a callback for freed buffer in rx pool #81646

Conversation

PavelVPV commented Nov 20, 2024

alwa-nordic commented Nov 20, 2024 • edited Loading

alwa-nordic commented Nov 20, 2024 • edited Loading

rugeGerritsen commented Nov 20, 2024

alwa-nordic commented Nov 20, 2024 • edited Loading

alwa-nordic commented Nov 20, 2024

PavelVPV commented Nov 21, 2024

PavelVPV commented Nov 21, 2024

PavelVPV commented Nov 21, 2024

alwa-nordic commented Nov 21, 2024

alwa-nordic Nov 26, 2024

Choose a reason for hiding this comment

alwa-nordic Nov 26, 2024

Choose a reason for hiding this comment

PavelVPV Nov 26, 2024

Choose a reason for hiding this comment

alwa-nordic Nov 27, 2024 • edited Loading

Choose a reason for hiding this comment

PavelVPV Nov 27, 2024

Choose a reason for hiding this comment

alwa-nordic Nov 26, 2024

Choose a reason for hiding this comment

PavelVPV Nov 26, 2024

Choose a reason for hiding this comment

jhedberg commented Dec 9, 2024

PavelVPV commented Dec 9, 2024

LuoZhongYao commented Dec 24, 2024

alwa-nordic commented Nov 20, 2024 •

edited

Loading

alwa-nordic commented Nov 20, 2024 •

edited

Loading

alwa-nordic commented Nov 20, 2024 •

edited

Loading

alwa-nordic Nov 27, 2024 •

edited

Loading