[type] [bug] Fix global load of CustomFloatType on CUDA #2115

Hanke98 · 2020-12-22T12:50:54Z

Related issue = #1905 #2065

Some mistakes were made in global loading on CUDA, which would lead to compilation error. I will point out the bug below.

And also, I added a new test case focusing on this problem in this pr

taichi/backends/cuda/codegen_cuda.cpp

Hanke98 · 2020-12-22T12:59:48Z

tests/python/test_custom_float.py

+    @ti.kernel
+    def test(data: ti.f32):
+        ti.cache_read_only(x)
+        assert x[None] == data


Force the x cache read only, to test global loading in codegen_cuda.cpp. Or, the x might be read the function in codegen_llvm.cpp

yuanming-hu

LGTM! Thanks.

taichi/backends/cuda/codegen_cuda.cpp

Co-authored-by: Yuanming Hu <[email protected]>

Hanke98 commented Dec 22, 2020

View reviewed changes

Hanke98 requested review from yuanming-hu, taichi-gardener and TH3CHARLie December 22, 2020 12:59

yuanming-hu approved these changes Dec 22, 2020

View reviewed changes

taichi/backends/cuda/codegen_cuda.cpp Show resolved Hide resolved

Hanke98 and others added 4 commits December 23, 2020 09:40

fix

508356a

add test case

98debfa

[skip ci] enforce code format

e96322f

add comments

f771b3b

Co-authored-by: Yuanming Hu <[email protected]>

Hanke98 force-pushed the debug-cft-global-load branch from 8277c6f to f771b3b Compare December 23, 2020 01:44

taichi-gardener and others added 2 commits December 22, 2020 20:45

[skip ci] enforce code format

372cd93

trigger CI

05aebf3

Hanke98 merged commit 111ad8c into taichi-dev:master Dec 23, 2020

k-ye mentioned this pull request Jan 5, 2021

[release] v0.7.12 #2144

Merged

Provide feedback