[opengl] random use different seed in each launch #692

archibate · 2020-04-02T15:35:38Z

So that examples/sdf_renderer.py works.

Related issue = #492

[Click here for the format server]

yuanming-hu · 2020-04-02T15:38:08Z

Thanks! I'll test it on my end. Meanwhile, let's finish #666 first so that we get rid of the redundant commits.

taichi/backends/opengl/opengl_api.cpp

archibate · 2020-04-02T15:43:10Z

We may also want to add a test case for this, to check if ti.random() return same when called again. e.g.

@ti.kernel
def func(i: ti.i32):
  x[i] = ti.random()
for i in range(5):
  func(i)
  assert x[i] != x[i - 1]

yuanming-hu · 2020-04-02T15:46:36Z

We may also want to add a test case for this, to check if ti.random() return same when called again. e.g.
@ti.kernel
def func(i: ti.i32):
  x[i] = ti.random()
for i in range(5):
  func(i)
  assert x[i] != x[i - 1]

This might be a brittle test since there's a tiny probability that x[i] == x[i - 1]. It will be better to, say repeat 5 times and assert there are at least 4 different results.

[skip ci] fix access out of bound in test_random

yuanming-hu · 2020-04-02T16:01:55Z

I confirm this works on my end. It reaches 90 samples per pixel per second, which is 1.8x faster than the CUDA backend!

archibate · 2020-04-02T16:05:23Z

That's amazing! Couldn't believe we are now faster than CUDA! Does this mean that CUDA still has lot of space of optimizations to be done? Or simply OpenGL is really better than CUDA?

yuanming-hu · 2020-04-02T16:08:00Z

I guess it's because CUDA is not yet fully optimized (and the OpenGL backend is well done). I haven't got a chance to work on it.

archibate · 2020-04-02T16:10:49Z

Seems #603 is just 杞人忧天ing... the performance of current OpenGL backend is really utilized now.
NEXT: to support sparse data structures on GL, so that the thought-to-be-weak GLSL can really 平起平坐 with CUDA :)

archibate · 2020-04-02T16:12:42Z

#603 says:

Most time costed on sync (data transfer from&to GPU) thanks to GL's great API design, we may want to figure out how to use it more efficiently.

In fact I found sync is only costly on mesa one, not NV, maybe their GPUs have really faster communication with CPU?

yuanming-hu · 2020-04-02T16:14:38Z

#603 says:

Most time costed on sync (data transfer from&to GPU) thanks to GL's great API design, we may want to figure out how to use it more efficiently.

In fact I found sync is only costly on mesa one, not NV, maybe their GPUs have really faster communication with CPU?

I guess it's because the NVIDIA driver is better implemented than mesa for NVIDIA GPUs.

archibate · 2020-04-02T16:17:04Z

Could you also test examples/nbody_oscillator.py on NV-GL & CUDA? I guess CUDA is better when dealing with memory intensive kernels (doesn't it?)

yuanming-hu · 2020-04-02T16:32:26Z

I have to change N to 80000 to make a difference. Then OpenGL reaches 15 FPS and CUDA 5 FPS. OpenGL is 3x faster :-) It seems me that there's a performance bug in the CUDA backend, which I should fix as soon as possible...

yuanming-hu · 2020-04-02T16:33:25Z

I updated https://github.com/taichi-dev/taichi/pull/692/files#diff-733c00d9e85a14e095b788b8b985e75c

Btw do you want to fix the memory leakage here? If not I'll go merge this PR.

archibate · 2020-04-02T16:34:30Z

It seems me that there's a performance bug in the CUDA backend, which I should fix as soon as possible...

Sounds a serious issue, good luck!

Btw do you want to fix the memory leakage here? If not I'll go merge this PR.

No, let's just keep this PR small and trackable :)

yuanming-hu · 2020-04-02T16:36:08Z

Cool, thanks. Merging this in now...

archibate requested a review from yuanming-hu April 2, 2020 15:38

archibate commented Apr 2, 2020

View reviewed changes

taichi/backends/opengl/opengl_api.cpp Show resolved Hide resolved

[skip ci] random use different seed in each launch

5b022de

archibate force-pushed the glrand branch from c6fa8fc to 5b022de Compare April 2, 2020 15:49

add randomness test case

fd5c92f

[skip ci] fix access out of bound in test_random

simplified nbody_oscillator.py

55b19b2

yuanming-hu merged commit 7b36dda into taichi-dev:master Apr 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[opengl] random use different seed in each launch #692

[opengl] random use different seed in each launch #692

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020 •

edited

Loading

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

[opengl] random use different seed in each launch #692

[opengl] random use different seed in each launch #692

Conversation

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020 • edited Loading

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020

yuanming-hu commented Apr 2, 2020

archibate commented Apr 2, 2020 •

edited

Loading