[async] More SFG optimizations #1877

yuanming-hu · 2020-09-16T15:22:25Z

Related issue = #742

List of changes:

Added misc/async_mgpcg.py, which is a fully asynchronous MGPCG. Note that the old MGPCG has Python-scope operations such as beta = sum[None], which force the system to synchronize
Activating global pointers now changes all the mask states from the SNode to the root
StateFlowGraph::optimize_listgen is re-written. Now the ClearListStmt gets removed together with the listgen task
StateFlowGraph::optimize_dead_store no longer removes ClearListStmt
AsyncEngine::synchronize() no longer do optimize_listgen after fusion, since ListGen with fused serials of ClearList is not yet supported (TODO: support...)
Every TaskLaunchRecord now has a unique id. This will be used when generating the dot files. Having a consistent id simplified debugging. StateFlowGraph::Node::launch_id is removed and we display the TaskLaunchRecord id only
Program now has a snodes mapping that maps SNode id to SNode pointer
StateFlowGraph::extract now takes a bool sort. Note that sorting an intermediate graph (which doesn't have order independency) can lead to wrong results
Added StateFlowGraph::demote_activation
Added a ConstExprPropagation for demote_activation
StateFlowGraph::dump_dot can now visual a state flow chain of a single state only. (TODO: make a standard API? it's harder coded for now. What should the API look like?)

codecov · 2020-09-16T16:14:38Z

Codecov Report

Merging #1877 into master will increase coverage by 0.95%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #1877      +/-   ##
==========================================
+ Coverage   43.25%   44.21%   +0.95%     
==========================================
  Files          44       44              
  Lines        6320     6111     -209     
  Branches     1092     1092              
==========================================
- Hits         2734     2702      -32     
+ Misses       3416     3240     -176     
+ Partials      170      169       -1

Impacted Files	Coverage Δ
python/taichi/lang/ast_checker.py	`70.58% <0.00%> (-1.64%)`	⬇️
python/taichi/testing.py	`75.00% <0.00%> (-0.72%)`	⬇️
python/taichi/lang/linalg.py	`89.33% <0.00%> (-0.67%)`	⬇️
python/taichi/lang/meta.py	`62.31% <0.00%> (-0.54%)`	⬇️
python/taichi/lang/__init__.py	`41.94% <0.00%> (-0.51%)`	⬇️
python/taichi/misc/util.py	`17.48% <0.00%> (-0.26%)`	⬇️
python/taichi/misc/task.py	`0.00% <0.00%> (ø)`
python/taichi/lang/shell.py	`0.00% <0.00%> (ø)`
python/taichi/tools/patterns.py	`0.00% <0.00%> (ø)`
python/taichi/lang/kernel.py	`71.17% <0.00%> (+0.13%)`	⬆️
... and 9 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 17c9fb7...9e5cfb9. Read the comment docs.

taichi/program/state_flow_graph.cpp

xumingkuan

Great!! LGTMig.

misc/visualize_state_flow_graph.py

taichi/program/state_flow_graph.cpp

k-ye

Sorry for the delay, I'm still catching up this part, so please feel free to merge! I left a question on the activation demotion..

k-ye · 2020-09-19T03:26:11Z

taichi/program/state_flow_graph.cpp

+      continue;
+
+    auto list_node = *node->input_edges[list_state].begin();
+    tasks[std::make_pair(node->rec.ir_handle, list_node)].push_back(node);


Just to confirm, if there's such a launch sequence:

@ti.kernel def foo(): for i in x: y[i] = ... @ti.kernel def bar(): for i in range(...): if some_cond(i): x[i] = ... # potentially changes x's mask/list state foo() bar() foo() # <-- cannot demote the activation of |y| now

Does this not demote activation as expected?

I believe the things will go as expected (although not tested). The second foo() uses a different list of x, and activation demotion will only happen when the same list states are used :-)

(Btw we should set up a testing system so that cases like this can be tested...)

taichi/program/state_flow_graph.cpp

yuanming-hu marked this pull request as draft September 16, 2020 15:22

yuanming-hu added 25 commits September 16, 2020 19:18

update mgpcg

ebdff1f

reproduce random outputs

3aeeeb9

reproduce empty lists

71d72d5

fix activating pointer meta data

0cadf76

remove sync in mgpcg

fd1694a

reproduce ti.sync removal leads to wrong results

963dd00

speed up optimize listgen

7ff62e3

reproduce empty lists

9edab19

reproduce fill tensor listgen 0

75253f5

reproduce fill tensor listgen 0

b3902ea

mgpcg2

cb5c4e7

simplify mgpcg2

53b5251

simplify mgpcg2

5cf981a

multires.py

021defd

multires.py

fddc0dc

reproduce bugs due to node delection

cd462b4

visualize state flow chain

509691f

rec::id

2713073

multi iteration MGPCG works with listgen

4f6fc9f

clean up

d368533

format

300f42c

10 MGPCG iters work (22K node)

faa39ec

improve dot visualization

0c72d3f

test demotion with listgen

3aaf484

demotion

fb77de7

yuanming-hu force-pushed the asyncmgpcg branch from 04cddb4 to fb77de7 Compare September 16, 2020 23:35

yuanming-hu added 2 commits September 16, 2020 19:46

node dot include hash

e2689ea

act demotion working on MGPCG together with listgen

366709d

yuanming-hu added 5 commits September 16, 2020 20:01

faster activation demotion

d345ede

no iterative opt for now

f619bd1

clean up

58f8cbc

clean up

556de9f

finalize

9b098a8

yuanming-hu marked this pull request as ready for review September 18, 2020 00:30

finalize

7cb76dc

yuanming-hu commented Sep 18, 2020

View reviewed changes

taichi/program/state_flow_graph.cpp Show resolved Hide resolved

yuanming-hu requested review from xumingkuan, k-ye and taichi-gardener September 18, 2020 02:11

xumingkuan approved these changes Sep 18, 2020

View reviewed changes

misc/visualize_state_flow_graph.py Show resolved Hide resolved

taichi/program/state_flow_graph.cpp Outdated Show resolved Hide resolved

k-ye approved these changes Sep 19, 2020

View reviewed changes

yuanming-hu added 2 commits September 19, 2020 00:49

apply review

3260e4f

.

9e5cfb9

yuanming-hu merged commit 1b78447 into taichi-dev:master Sep 19, 2020

yuanming-hu mentioned this pull request Sep 19, 2020

[release] v0.6.36 #1882

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[async] More SFG optimizations #1877

[async] More SFG optimizations #1877

yuanming-hu commented Sep 16, 2020 •

edited

Loading

codecov bot commented Sep 16, 2020 •

edited

Loading

xumingkuan left a comment

k-ye left a comment

k-ye Sep 19, 2020

yuanming-hu Sep 19, 2020

[async] More SFG optimizations #1877

[async] More SFG optimizations #1877

Conversation

yuanming-hu commented Sep 16, 2020 • edited Loading

codecov bot commented Sep 16, 2020 • edited Loading

Codecov Report

xumingkuan left a comment

Choose a reason for hiding this comment

k-ye left a comment

Choose a reason for hiding this comment

k-ye Sep 19, 2020

Choose a reason for hiding this comment

yuanming-hu Sep 19, 2020

Choose a reason for hiding this comment

yuanming-hu commented Sep 16, 2020 •

edited

Loading

codecov bot commented Sep 16, 2020 •

edited

Loading