fix: EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed #3631

dragoljub-duric · 2025-01-27T14:52:45Z

Problem:
As previously observed by @berestovskyy #3455 (comment) it may happen that execution of low_wasm_memory hook is stopped when wasm_memory_limit < used_wasm_memory.
Solution:
If that happens, run the hook after the error is fixed if the hook condition remains satisfied.

…-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit

rs/execution_environment/src/execution/update.rs

…-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit

berestovskyy

LGTM, thanks!

rs/execution_environment/src/execution_environment/tests/canister_task.rs

rs/execution_environment/src/execution/update.rs

…on-is-stopped-because-wasm-memory-usage-wasm-memory-limit' of github.com:dfinity/ic into EXC-1838-revisit-hook-status-behavior-when-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit

mraszyk · 2025-02-05T12:06:50Z

rs/execution_environment/src/execution/update.rs

+            if err.code() == ErrorCode::CanisterWasmMemoryLimitExceeded
+                && original.call_or_task == CanisterCallOrTask::Task(CanisterTask::OnLowWasmMemory)


@dragoljub-duric Wouldn't it be better to not perform any execution in this case (instead of spending cycles on an execution that fails)? Isn't it possible to check the limits in advance before running the execution?

UpdateHelper::new immediately checks the limit, in this file in line 371 below. So we may move this check in UpdateHelper::new but it will require refactoring of the UpdateHelper::new because, in the case of the error, it should return a modified state (the state where we put the back hook on the task queue). Does that answer your question?

Does that answer your question?

Not really because it is not clear to me (without looking into the code that I'm not super familiar with) at what point in time the failure happens and if cycles are charged. Could you please clarify that a bit more?

https://github.com/dfinity/ic/pull/3631/files#r1942802433

Context we concluded that: we're charging the base fee of 5M cycles per execution nonetheless, as update_message_execution_fee in prepay_execution_cycles which is not refunded in this case.

The quick fix I see is that in this case, we can refund an additional 5M. @mraszyk what do you think?

The quick fix I see is that in this case

As a quick fix, it makes sense, but I wonder if the code doesn't become fragile due to such a fix. Do you see a way to avoid the refund by not preparing the execution (prepaying etc.) at all?

I think it is double, I am trying to add check-in execute_call_or_task before calling prepay_execution_cycles.

This check could then ideally also apply to global timer etc.

Yes, it will apply to all updates/tasks.

rs/execution_environment/src/execution/update.rs

mraszyk · 2025-02-06T08:16:46Z

rs/execution_environment/src/execution/update.rs

@@ -341,26 +359,22 @@ impl UpdateHelper {

        validate_message(&canister, &original.method)?;

-        if let CanisterCallOrTask::Call(_) = original.call_or_task {
-            // TODO(RUN-957): Enforce the limit in heartbeat and timer after


There's still one more TODO(RUN-957) in the code to be resolved. CC @dragoljub-duric

But in my opinion, it seems safer to not enforce the limit during a system task, i.e., simply drop the other TODO(RUN-957), instead of trapping during a system task.

mraszyk · 2025-02-06T08:51:05Z

rs/execution_environment/src/execution/update.rs

@@ -341,26 +359,22 @@ impl UpdateHelper {

        validate_message(&canister, &original.method)?;

-        if let CanisterCallOrTask::Call(_) = original.call_or_task {


Before this PR, we wouldn't be enforcing the limit for system tasks here: so I'm not sure why this PR is needed at all; the current effect of this PR seems to be as follows:

global timers and heartbeats fail if the wasm memory limit is exceeded initially (although they succeed if the wasm memory limit is exceeded during their execution): this behavior seems surprising to me

low on wasm memory hooks are retried if the wasm memory limit is exceeded initially (although they succeed if the wasm memory limit is exceeded during their execution): this behavior might be undesirable since the hook might be crucial in resolving the exceeded wasm memory limit and it wouldn't run due to this PR.

global timers and heartbeats fail if the wasm memory limit is exceeded initially (although they succeed if the wasm memory limit is exceeded during their execution): this behavior seems surprising to me

This sounds expected to me, and it will behave the same way as in the update case. In my opinion, having a homogenous behavior of tasks/updates is a plus.

low on wasm memory hooks are retried if the wasm memory limit is exceeded initially (although they succeed if the wasm memory limit is exceeded during their execution): this behavior might be undesirable since the hook might be crucial in resolving the exceeded wasm memory limit and it wouldn't run due to this PR.

I can see the point in this one, maybe you are right. If the developer uses the hook to notify himself that memory is below the threshold, having the hook stopped in this case may be unexpected.

dragoljub-duric added 7 commits January 27, 2025 14:49

.

097440b

Merge branch 'master' of github.com:dfinity/ic

288ddc2

.

c94139f

Merge branch 'master' of github.com:dfinity/ic

008ec28

Merge branch 'master' into EXC-1838-revisit-hook-status-behavior-when…

de4e3be

…-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit

.

88fa94e

.

023b371

dragoljub-duric commented Feb 3, 2025

View reviewed changes

rs/execution_environment/src/execution/update.rs Show resolved Hide resolved

dragoljub-duric changed the title ~~Exc 1838 revisit hook status behavior when hook execution is stopped because wasm memory usage wasm memory limit~~ fix: EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed Feb 3, 2025

github-actions bot added the fix label Feb 3, 2025

.

93796e0

dragoljub-duric marked this pull request as ready for review February 3, 2025 13:02

dragoljub-duric requested a review from a team as a code owner February 3, 2025 13:02

Merge branch 'master' into EXC-1838-revisit-hook-status-behavior-when…

b677329

…-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit

github-actions bot added the @execution label Feb 3, 2025

berestovskyy approved these changes Feb 3, 2025

View reviewed changes

rs/execution_environment/src/execution_environment/tests/canister_task.rs Outdated Show resolved Hide resolved

rs/execution_environment/src/execution/update.rs Show resolved Hide resolved

dragoljub-duric added 2 commits February 3, 2025 14:02

fix test name

5665cf7

Merge branch 'EXC-1838-revisit-hook-status-behavior-when-hook-executi…

3c9dc27

…on-is-stopped-because-wasm-memory-usage-wasm-memory-limit' of github.com:dfinity/ic into EXC-1838-revisit-hook-status-behavior-when-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit

dragoljub-duric enabled auto-merge February 3, 2025 14:16

dragoljub-duric added this pull request to the merge queue Feb 3, 2025

Merged via the queue into master with commit 773b035 Feb 3, 2025
25 checks passed

dragoljub-duric deleted the EXC-1838-revisit-hook-status-behavior-when-hook-execution-is-stopped-because-wasm-memory-usage-wasm-memory-limit branch February 3, 2025 14:35

dragoljub-duric mentioned this pull request Feb 5, 2025

feat: on low wasm memory hook dfinity/portal#3761

Draft

mraszyk reviewed Feb 5, 2025

View reviewed changes

rs/execution_environment/src/execution/update.rs Show resolved Hide resolved

dragoljub-duric commented Feb 5, 2025

View reviewed changes

rs/execution_environment/src/execution/update.rs Show resolved Hide resolved

mraszyk reviewed Feb 6, 2025

View reviewed changes

dragoljub-duric mentioned this pull request Feb 7, 2025

fix: revert EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed #3850

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed #3631

fix: EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed #3631

dragoljub-duric commented Jan 27, 2025 •

edited

Loading

berestovskyy left a comment

mraszyk Feb 5, 2025

dragoljub-duric Feb 5, 2025

mraszyk Feb 5, 2025

dragoljub-duric Feb 5, 2025

dragoljub-duric Feb 5, 2025

dragoljub-duric Feb 5, 2025

mraszyk Feb 5, 2025

dragoljub-duric Feb 5, 2025

mraszyk Feb 5, 2025

dragoljub-duric Feb 5, 2025

mraszyk Feb 6, 2025

mraszyk Feb 6, 2025 •

edited

Loading

mraszyk Feb 6, 2025

dragoljub-duric Feb 6, 2025 •

edited

Loading

		if err.code() == ErrorCode::CanisterWasmMemoryLimitExceeded
		&& original.call_or_task == CanisterCallOrTask::Task(CanisterTask::OnLowWasmMemory)

		@@ -341,26 +359,22 @@ impl UpdateHelper {

		validate_message(&canister, &original.method)?;

		if let CanisterCallOrTask::Call(_) = original.call_or_task {

fix: EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed #3631

fix: EXC-1838 Run hook after CanisterWasmMemoryLimitExceeded error is fixed #3631

Conversation

dragoljub-duric commented Jan 27, 2025 • edited Loading

berestovskyy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mraszyk Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dragoljub-duric Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

dragoljub-duric commented Jan 27, 2025 •

edited

Loading

mraszyk Feb 6, 2025 •

edited

Loading

dragoljub-duric Feb 6, 2025 •

edited

Loading