Add new tensor diposal algorithm for sync graph execution #7392

chunnienc · 2023-02-17T18:35:35Z

See code comments for more details.
It only works for sync execution since the new algorithm depends on a fixed node execution order. The time complexity decreases from O(NumEdges*NumTensors) or more precise O(NumTensorEdges) to O(NumTensors).
The time saved is around 5% for MobileNetV3 (again, tested on my laptop and may be imprecise).

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

tfjs-converter/src/executor/graph_executor.ts

mattsoulanille · 2023-02-18T00:00:19Z

tfjs-converter/src/executor/graph_executor.ts

+  private checkTensorForDisposalWithNodeLiveUntilInfo(
+      node: Node, tensorMap: NamedTensorsMap, tensorsToKeep: Set<number>,
+      outputNodeNameSet: Set<string>, liveUntilNodes?: Node[]) {
+    // Skip output nodes and any control flow nodes, since its dependency is
+    // tricky to track correctly.
+    if (isControlFlow(node) || outputNodeNameSet.has(node.name)) {
+      return;
+    }
+    if (liveUntilNodes == null) {
+      return;
+    }
+
+    for (const node of liveUntilNodes) {
+      for (const tensor of tensorMap[node.name]) {
+        if (!tensor || tensor.kept || tensorsToKeep.has(tensor.id)) {
+          continue;
+        }
+        tensor.dispose();
+      }
+    }


This function is only called in one place, and it's essentially checking some properties of node and then deleting everything in liveUntilNodes. While liveUntilNodes comes from a map of nodes with a specific property about when and why its nodes may be disposed, this function's implementation doesn't use that property (since it's all done in the setup of the nodeLiveUntilMap). Maybe we can rename this function to be more generic, like disposeNodes and only pass in a list of nodes to dispose. We could also factor out this snippet from here and the original checkNodesForDisposal.

if (isControlFlow(node) || outputNodeNameSet.has(node.name)) { return; }

What do you think?

I intentionally make this function keep similar signature and name as the original checkNodesForDisposal, to maintain the parity of tensor disposing functions in sync and async graph executor. I want to make it easier for future maintainer to understand that they are the same, at least from the aspect of intentions and behaviors.

I think a better rewrite may be renaming both of them to something like "function disposeNodes...(...)", since the original function not only "checks" the tensors but also disposes tensors in it. What do you think?

For factoring out some stuff, I don't see huge gains on making a separate function for that single if check. I'm thinking to rewrite the async disposing algorithm to utilize the node dependency and reduce loops over tensor edges, but I haven't figured out a better algo (and I need to learn more about async control flow to make sure the new algo won't break it). Maybe we can have a better picture of which parts can be shared then.

tfjs-converter/src/executor/model_analysis.ts

mattsoulanille · 2023-02-18T02:02:35Z

tfjs-converter/src/operations/executors/utils.ts

-export function parseNodeName(name: string): [string, number, string] {
+export function parseNodeName(
+    name: string, context?: ExecutionContext): [string, number, string?] {
+  if (name === '') {
+    return ['', 0, undefined];
+  }
+
+  const isCacheEnabled = context != null && context.parseNodeNameCache != null;
+  if (isCacheEnabled) {
+    const cachedResult = context.parseNodeNameCache.get(name);
+    if (cachedResult != null) {
+      return cachedResult;
+    }
+  }
  const parts = name.split(':');
+  let result: [string, number, string?];
  if (parts.length === 1) {
-    return [name, 0, undefined];
+    result = [name, 0, undefined];


This is from #7391, so that PR should be merged before this one.

tfjs-converter/src/executor/model_analysis.ts

Co-authored-by: Matthew Soulanille <[email protected]>

tfjs-converter/src/executor/model_analysis.ts

Co-authored-by: Matthew Soulanille <[email protected]>

…osal

Linchenn

LGTM

chunnienc added 11 commits February 16, 2023 23:25

Add parseNodeName cache

6da8989

fix

467f9f0

Fix getParamValue

2a796da

Refactor check tensor disposal function

2c1e3f2

Add new tensor disposal algorithm

5f3c0d4

Fix constant spelling error

eb544a3

Use set for string existence check

edd53d5

Move getNodeLiveUntilMap to mode_analysis.ts

0477066

Fix lint

68ac9cb

Merge branch 'graph-exec-perf' into graph-tensor-disposal

7ef1acb

fix lint

5a752b7

chunnienc requested review from mattsoulanille and Linchenn February 17, 2023 18:36

chunnienc marked this pull request as ready for review February 17, 2023 18:37

mattsoulanille reviewed Feb 18, 2023

View reviewed changes

chunnienc and others added 6 commits February 17, 2023 21:09

Update tfjs-converter/src/executor/model_analysis.ts

62b312a

Co-authored-by: Matthew Soulanille <[email protected]>

Update tfjs-converter/src/executor/model_analysis.ts

3d26ca6

Co-authored-by: Matthew Soulanille <[email protected]>

Update tfjs-converter/src/executor/model_analysis.ts

052e750

Co-authored-by: Matthew Soulanille <[email protected]>

Update tfjs-converter/src/executor/graph_executor.ts

fe84f48

Co-authored-by: Matthew Soulanille <[email protected]>

Refactor based on comments

012ccd7

Rewrite liveUntil array comment

5c587d9

mattsoulanille reviewed Feb 18, 2023

View reviewed changes

tfjs-converter/src/executor/model_analysis.ts Outdated Show resolved Hide resolved

chunnienc and others added 2 commits February 18, 2023 09:11

Update tfjs-converter/src/executor/model_analysis.ts

810662f

Co-authored-by: Matthew Soulanille <[email protected]>

Rewrite liveUntil builder

919e10b

mattsoulanille approved these changes Feb 21, 2023

View reviewed changes

Merge remote-tracking branch 'upstream/master' into graph-tensor-disp…

f0acdbe

…osal

Linchenn approved these changes Feb 22, 2023

View reviewed changes

Merge branch 'master' into graph-tensor-disposal

e46bf72

chunnienc merged commit 8c460d9 into tensorflow:master Feb 22, 2023

chunnienc deleted the graph-tensor-disposal branch February 22, 2023 22:38

axinging mentioned this pull request Mar 3, 2023

Tensor is disposed too early for FaceLandmarkDetection&architecture=attention_mesh #7445

Closed

vladmandic mentioned this pull request Mar 21, 2023

regression: tensor is disposed early #7504

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new tensor diposal algorithm for sync graph execution #7392

Add new tensor diposal algorithm for sync graph execution #7392

chunnienc commented Feb 17, 2023 •

edited

Loading

mattsoulanille Feb 18, 2023

chunnienc Feb 18, 2023

mattsoulanille Feb 18, 2023

Linchenn left a comment

Add new tensor diposal algorithm for sync graph execution #7392

Add new tensor diposal algorithm for sync graph execution #7392

Conversation

chunnienc commented Feb 17, 2023 • edited Loading

mattsoulanille Feb 18, 2023

Choose a reason for hiding this comment

chunnienc Feb 18, 2023

Choose a reason for hiding this comment

mattsoulanille Feb 18, 2023

Choose a reason for hiding this comment

Linchenn left a comment

Choose a reason for hiding this comment

chunnienc commented Feb 17, 2023 •

edited

Loading