[type] Local adder structure #2136

TH3CHARLie · 2021-01-01T16:10:55Z

Related issue = #1905

TH3CHARLie · 2021-01-05T19:26:38Z

some performance numbers comparing two kinds of loops:

[ 41.96%   0.105 s      1x |  105.085   105.085   105.085 ms] evolve_naive_c8_0_kernel_10_range_for
[  0.70%   0.002 s      1x |    1.756     1.756     1.756 ms] evolve_vectorized_c6_0_kernel_7_struct_for
[  0.51%   0.001 s      1x |    1.272     1.272     1.272 ms] evolve_vectorized_c6_0_kernel_6_listgen_S2dense
[  0.37%   0.001 s      1x |    0.926     0.926     0.926 ms] evolve_vectorized_c6_0_kernel_4_listgen_S1pointer
[  0.00%   0.000 s      1x |    0.001     0.001     0.001 ms] evolve_vectorized_c6_0_kernel_5_serial
[  0.00%   0.000 s      1x |    0.001     0.001     0.001 ms] evolve_vectorized_c6_0_kernel_3_serial

yuanming-hu

Awesome!! Just a few nits. Thanks!

yuanming-hu · 2021-01-05T21:16:28Z

taichi/ir/statements.h

+      : op_type(op_type),
+        lhs(lhs),
+        rhs(rhs),
+        is_bit_vectorized(is_bit_vectorized) {


This is a little too much intrusion into the existing system. It seems to me that is_bit_vectorized is only used in the bit_loop_vectorize pass - maybe you can use an std::unordered_map<Stmt *, bool> member variable in class BitLoopVectorize, instead of adding a new field in class BinaryOpStmt? (Just like llvm_val in LLVM codegens.)

is_bit_vectorized is used only in the bit_loop_ vectorize pass when tagged on BinaryOpStmt and this part should be replaced with some pass-scope data structure, just as you suggested. But for GlobalPtrStmt and GetChStmt, they need the tag to pass later passes including lower_access and type_check

Right, that's what I meant: we can use a pass-scope data structure just for BinaryOpStmt::is_bit_vectorized. Given we are rushing for the deadline it's fine that we don't do it now.

tests/python/test_bit_array_vectorization.py

yuanming-hu · 2021-01-05T21:38:59Z

taichi/transforms/bit_loop_vectorize.cpp

+            and_b_c->is_bit_vectorized = true;
+            // modify IR
+            auto and_a_b_p = and_a_b.get();
+            stmt->insert_before_me(std::move(load_a));


A more elegant way to do this:

taichi/taichi/transforms/demote_operations.cpp

Line 22 in b13abfa

VecStatement statements;

Use VecStatement::push_back<...> to create the statements and stmt->insert_before_me(vec_stmt) to insert. (You don't need the DelayedIRModifier part.)

I find to do it the elegant way we may still need DelayedIRModifier and there are other places that require changes as well(e.g. see the visitor for GlobalLoadStmt), therefore I think we should do it in a separate PR later and refactoring all this together. This also applies to the changes for BinaryOpStmt::is_bit_vectorized.

yuanming-hu · 2021-01-05T21:39:43Z

taichi/transforms/bit_loop_vectorize.cpp

+    stmt->insert_before_me(std::move(load_c));
+    stmt->insert_before_me(std::move(carry_c));
+    stmt->insert_before_me(std::move(sum_c));
+    stmt->insert_before_me(std::move(load_b));
+    stmt->insert_before_me(std::move(carry_b));
+    stmt->insert_before_me(std::move(sum_b));
+    stmt->insert_before_me(std::move(sum_a));


test all common 8 cases

6c9ad1d

TH3CHARLie marked this pull request as draft January 1, 2021 16:11

TH3CHARLie requested a review from taichi-gardener January 1, 2021 16:11

yuanming-hu changed the title ~~[type] Local Structure~~ [type] Local adder structure Jan 1, 2021

TH3CHARLie added 2 commits January 5, 2021 11:46

transform atomic add and clean up

e487acd

format

d7840f3

transform boolean expr and make test pass

9daee57

TH3CHARLie requested review from yuanming-hu and removed request for taichi-gardener January 5, 2021 19:31

TH3CHARLie marked this pull request as ready for review January 5, 2021 19:32

yuanming-hu approved these changes Jan 5, 2021

View reviewed changes

TH3CHARLie added 2 commits January 6, 2021 08:43

refine test case

9dcad62

inline function

6d5a002

TH3CHARLie merged commit 7df6f22 into taichi-dev:master Jan 6, 2021

k-ye mentioned this pull request Feb 4, 2021

[release] v0.7.13 #2180

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[type] Local adder structure #2136

[type] Local adder structure #2136

TH3CHARLie commented Jan 1, 2021

TH3CHARLie commented Jan 5, 2021

yuanming-hu left a comment

yuanming-hu Jan 5, 2021

TH3CHARLie Jan 6, 2021

yuanming-hu Jan 6, 2021

yuanming-hu Jan 5, 2021

TH3CHARLie Jan 6, 2021

yuanming-hu Jan 5, 2021

[type] Local adder structure #2136

[type] Local adder structure #2136

Conversation

TH3CHARLie commented Jan 1, 2021

TH3CHARLie commented Jan 5, 2021

yuanming-hu left a comment

Choose a reason for hiding this comment

yuanming-hu Jan 5, 2021

Choose a reason for hiding this comment

TH3CHARLie Jan 6, 2021

Choose a reason for hiding this comment

yuanming-hu Jan 6, 2021

Choose a reason for hiding this comment

yuanming-hu Jan 5, 2021

Choose a reason for hiding this comment

TH3CHARLie Jan 6, 2021

Choose a reason for hiding this comment

yuanming-hu Jan 5, 2021

Choose a reason for hiding this comment