Add cache for parseNodeName #7391

chunnienc · 2023-02-17T17:59:37Z

This PR adds cache for parseNodeName - an expensive string manipulation function in graph executor. The time saved is around 2% to 3% for MobileNetV3 (tested on my laptop, may be imprecise).

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

mattsoulanille

LGTM. I was going to recommend using a closure to memoize parseNodeName instead of adding it to context but it looks like you need to pass this context to other functions like getParamValue.

chunnienc · 2023-02-17T19:54:51Z

Yeah. I think the context is suitable since it covers calls from parseNodeName itself and also getTensor and getParamValue, and it associates the lifecycle of the strings with the graph, so that these string keys can be gc'ed with the graph.

Linchenn · 2023-02-21T22:09:36Z

Thanks! I think this approach would benefit small models a lot, by introducing cache and cache lookup to avoid string operations.

Have you benchmarked some large models? For large models, the number of nodes may be significant, so I am not sure, when the cache is very large, whether the cache lookup is still faster than string operations?

chunnienc · 2023-02-21T23:32:02Z

Thanks! I think this approach would benefit small models a lot, by introducing cache and cache lookup to avoid string operations.

Have you benchmarked some large models? For large models, the number of nodes may be significant, so I am not sure, when the cache is very large, whether the cache lookup is still faster than string operations?

(Discussed offline) Yes it's always faster.
Benchmark: https://jsbench.me/8eleeumdpi/1

Linchenn

LGTM, thanks!

chunnienc added 3 commits February 16, 2023 23:25

Add parseNodeName cache

6da8989

fix

467f9f0

Fix getParamValue

2a796da

chunnienc requested review from mattsoulanille and Linchenn February 17, 2023 17:59

Fix lint

68ac9cb

chunnienc marked this pull request as ready for review February 17, 2023 18:15

mattsoulanille approved these changes Feb 17, 2023

View reviewed changes

mattsoulanille mentioned this pull request Feb 18, 2023

Add new tensor diposal algorithm for sync graph execution #7392

Merged

Linchenn approved these changes Feb 21, 2023

View reviewed changes

Merge branch 'master' into graph-exec-perf

4c5b5fe

chunnienc merged commit 7dfcc76 into tensorflow:master Feb 22, 2023

chunnienc deleted the graph-exec-perf branch February 22, 2023 06:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cache for parseNodeName #7391

Add cache for parseNodeName #7391

chunnienc commented Feb 17, 2023 •

edited

Loading

mattsoulanille left a comment

chunnienc commented Feb 17, 2023 •

edited

Loading

Linchenn commented Feb 21, 2023

chunnienc commented Feb 21, 2023

Linchenn left a comment

Add cache for parseNodeName #7391

Add cache for parseNodeName #7391

Conversation

chunnienc commented Feb 17, 2023 • edited Loading

mattsoulanille left a comment

Choose a reason for hiding this comment

chunnienc commented Feb 17, 2023 • edited Loading

Linchenn commented Feb 21, 2023

chunnienc commented Feb 21, 2023

Linchenn left a comment

Choose a reason for hiding this comment

chunnienc commented Feb 17, 2023 •

edited

Loading

chunnienc commented Feb 17, 2023 •

edited

Loading