[OpenGL] Basic OpenGL backend #492 #495

archibate · 2020-02-18T02:14:33Z

Related issue id = #492

archibate · 2020-02-18T02:18:11Z

I could't wait to share you more about the work!

yuanming-hu · 2020-02-18T03:18:31Z

Great! I have one small suggestion (which will make PR/code maintenance much easier): keep each PR small. This will also make your life easier since you have fewer conflicts and get quicker reviews. The PR process of the Metal backend #396 is a great example.

I'm super excited to witness the birth of a new backend! At the same time, we should also pay more attention to code maintainability, as we are getting more and more backends :-) Maybe we should consider extracting the common parts of the OpenGL and Metal (and potentially OpenCL in the future) codegen into a base class.

k-ye · 2020-02-18T03:42:50Z

I can share some experiences when working on the Metal one, or any other big features. I think it’s better to have a working solution in your local repo before opening PRs. The local solution doesn’t have to be clean or anything, just a proof of concept. Then we can start breaking down the solution into smaller PRs, possibly doing some cleanups during this process. This way you can save a lot of the reviewer’s bandwidth :-) We also don’t run into the risk that, in case there is a major design flaw, the main codebase gets polluted by incomplete solutions.

yuanming-hu · 2020-02-18T04:06:09Z

Yeah, I strongly agree on this two-stage strategy: writing a new backend inevitably has a lot of uncertainty, so it's great to have a minimal working prototype (e.g. one that runs mpm99.py, which is already pretty challenging IMO). Then clean up the code and break down the changes into small (and preferably testable, if I'm not asking for too much) PRs, each with clear meaning.

As @k-ye said, doing so allows you to get quick design feedback and thereby minimizes the risk of high-level design errors. It also minimizes the number of hours you spend before you get a working codegen & demos (and everybody gets super excited so you get more helping hands).

Last but not least, following this strategy makes the reviewer's job much easier :-)

archibate · 2020-02-18T04:32:08Z

Thank for your advises! Managed to use git rebase to reduce reviewer's pressure.

archibate · 2020-02-18T04:42:53Z

Btw, I found https://github.com/9ballsyndrome/WebGL_Compute_shader!

archibate · 2020-02-18T12:47:12Z

How to deal with globals? How can I get raw data from snodes? I can pass in/out data to/from GL now.

archibate · 2020-02-18T15:39:34Z

You probably want to tell me here, so I'll see it tomorrow. Good night~~

yuanming-hu · 2020-02-18T18:11:12Z

We need to allocate something like a pixel buffer (or something related, my OpenGL knowledge is rusty), which we use as a byte array to store the global data structure :-)

yuanming-hu · 2020-02-18T22:00:17Z

Btw, I found https://github.com/9ballsyndrome/WebGL_Compute_shader!

It will be great if we can do compute shader on WebGL, but my read is that it might take a couple more years for WebGL compute shader to become mature :-) For example, it's still not yet supported by default in my browser.

taichi/backends/codegen_opengl.cpp

taichi/platform/opengl/opengl_api.cpp

taichi/backends/codegen_opengl.cpp

taichi/platform/opengl/opengl_api.cpp

archibate · 2020-02-19T03:02:19Z

I need a deeper understand of snode now. When we say

x = ti.var(ti.i32, shape=())
@ti.kernel
def func():
  x[None] = 233

x isn't allocated this time until kernel called, right?
Then, does the backend responsible for allocation?
In fact, SSBOs aren't so precious, they don't need to be allocated like cudaMallocManaged this way. Simply malloc can works - any part of CPU memory can be bound with SSBO.

yuanming-hu · 2020-02-19T03:11:56Z

x isn't allocated this time until kernel called, right?

Yes, it will be allocated the first time one kernel is called, or when you access the data structure in Python scope (e.g. x[1, 2, 3]=3 outside Taichi kernels)

Then, does the backend responsible for allocation?

The struct compiler does allocation:

taichi/taichi/backends/struct_llvm.cpp

Line 292 in 46490e7

creator = [=]() {

The struct_metal file also worth checking out.

archibate · 2020-02-19T03:17:32Z

or when you access the data structure in Python scope (e.g. x[1, 2, 3]=3 outside Taichi kernels)

I also noticed that x[1] = 2 or print(x[1]) is translated into a kernel with a argument somehow...

The struct_metal file also worth checking out.

Oh, I almost forget that!

taichi/taichi/backends/struct_llvm.cpp

Line 300 in 46490e7

prog->memory_pool->set_queue((MemRequestQueue *)mem_req_queue);

Seems you have a memory pool there working for LLVM backend? Maybe GL backend can use it later too, as lone as it doesn't use CUDA API.

yuanming-hu · 2020-02-19T06:51:07Z

I also noticed that x[1] = 2 or print(x[1]) is translated into a kernel with an argument somehow...

Yeah, you'll have to launch a compute shader task for single element accesses like these. A more efficient way is to use numpy interfaces to/from_numpy() - I would suggest start with the numpy interface first since that's easier to implement.

Seems you have a memory pool there working for LLVM backend? Maybe GL backend can use it later too, as lone as it doesn't use CUDA API.

The memory pool stuff relies on unified memory (which to my knowledge is not supported by OpenGL), so I would suggest starting with just dense snodes, whose storage can be directly pre-allocated.

archibate · 2020-02-19T07:29:58Z

to_numpy need struct_for kernel, which is not implemented for now.

archibate · 2020-02-19T09:37:30Z

Memory allocate done. Next event is to figure out how x[0] (read_int) works, wanna see print shows 233 on screen.

[skip ci] test inout data[3] success [skip ci] try to pass kernel arguments

archibate · 2020-02-19T11:38:02Z

We're making history! After this and stage 2: struct_for_kernel & parallel process done, I think we're going to release our GPGPU programming language on both Linux, OS X and Windows with a even wider usersphere!!! I've also thought about the stage 3 title: nested for & func~

archibate · 2020-02-19T16:43:17Z

Ok! Will fix the bug tomorrow. Check examples/opengl_example.py for test, and good night.

…GL only supports i32, f32, f64)

archibate · 2020-02-25T09:29:41Z

NOTE: 976d387 may have conflicts with #527.

archibate · 2020-02-25T09:39:42Z

TODO: atan2 -> atan.

       genType atan(genType y, genType x);
       genType atan(genType y_over_x);

archibate · 2020-02-25T11:11:23Z

I simply skipped OpenGL from test_types for now, since it doesn't support lots of types involved in test_types like u8. All types supported by GLSL firmware: i1(bool), i32, u32, f32, f64.

k-ye · 2020-02-25T11:44:22Z

I saw that you requested a review. If you think this is ready, as suggested before, could you break this down into smaller PRs? Otherwise it's very hard for reviewing.. You may need to create new branches and open new PRs, whereas archibate:opengl will be an experimental branch containing a working solution for reference ;)

archibate · 2020-02-25T13:06:49Z

Yes, I will, after everything done. But the actual reason why I requested you for review is, I'm afraid my changes in test_types.py could broke yours.

archibate · 2020-02-25T14:24:17Z

ALL TESTS PASSED!!!

k-ye · 2020-02-25T14:49:43Z

I'm afraid my changes in test_types.py could broke yours.

Ha thanks. Don't worry too much until it actually gets merged... And I think it's fine not to worry too much about breaking the metal backend. After all, it's hard to test the code if you don't even have a way to run it..

archibate · 2020-02-27T09:46:40Z

Break down into fragments, closing here.

yuanming-hu self-requested a review February 18, 2020 03:11

archibate force-pushed the opengl branch from a65a224 to 28f6b84 Compare February 18, 2020 04:30

archibate force-pushed the opengl branch from 28f6b84 to 0a8b3c0 Compare February 18, 2020 05:19

archibate changed the title ~~[WIP] Basic OpenGL backend~~ [GLSL backend] stage 1: serial_kernel Feb 18, 2020

archibate changed the title ~~[GLSL backend] stage 1: serial_kernel~~ [OpenGL] stage 1: serial_kernel & simple arrays Feb 18, 2020

archibate commented Feb 19, 2020

View reviewed changes

archibate force-pushed the opengl branch from 86bd047 to a6f419f Compare February 19, 2020 09:42

archibate added 2 commits February 19, 2020 17:43

[skip ci] basic startup for opengl & working example

c6d27ee

[skip ci] test inout data[3] success [skip ci] try to pass kernel arguments

[skip ci] use a struct_metal style struct compiler

fddb46f

archibate force-pushed the opengl branch from a6f419f to a68299f Compare February 19, 2020 09:43

[skip ci] basic allocator

8ed42c9

archibate force-pushed the opengl branch from a68299f to 8ed42c9 Compare February 19, 2020 11:41

[skip ci] mark ti.archs_excluding(ti.opengl) in test_types.py (since …

976d387

…GL only supports i32, f32, f64)

archibate force-pushed the opengl branch from 98c836b to 976d387 Compare February 25, 2020 11:00

archibate added 2 commits February 25, 2020 19:01

[skip ci] fix test_unary_ops.py for opengl

b9bf37c

merge opengl with master

f18db4e

archibate requested a review from k-ye February 25, 2020 11:09

archibate force-pushed the opengl branch from 51501c5 to f5b173e Compare February 25, 2020 11:35

[skip ci] cmake add GLEW_VERSION

f47420b

archibate force-pushed the opengl branch from f5b173e to f47420b Compare February 25, 2020 13:01

[skip ci] merge opengl with master again

206be11

archibate added 8 commits February 25, 2020 21:07

[skip ci] GLEW not found is FATAL_ERROR

17f1f2f

[skip ci] atan2 -> atan for glsl

08f27ea

[skip ci] disable test_print for opengl

335c5e3

[skip ci] add customized ti.approx to partial rel=1e4 for opengl

d0d0b99

[skip ci] add reinterpret_cast by intBitsToFloat()

a8a6009

[skip ci] rcp, rsqrt and bit_not

3c58e93

[skip ci] test_internal_func test_types exclude opengl

5631967

[skip ci] test_offload_with_flexible_bounds exclude opengl

50f03ea

[skip ci] test_ad_demote_dense exclude opengl

e1c574e

archibate mentioned this pull request Feb 25, 2020

OpenGL backend (stage 1) #535

Merged

[skip ci] add a lossless version for int // int

8b89df4

archibate closed this Feb 27, 2020

archibate mentioned this pull request Mar 31, 2020

[OpenGL] Support NVIDIA GLSL compiler #666

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenGL] Basic OpenGL backend #492 #495

[OpenGL] Basic OpenGL backend #492 #495

archibate commented Feb 18, 2020

archibate commented Feb 18, 2020 •

edited

Loading

yuanming-hu commented Feb 18, 2020 •

edited

Loading

k-ye commented Feb 18, 2020

yuanming-hu commented Feb 18, 2020 •

edited

Loading

archibate commented Feb 18, 2020

archibate commented Feb 18, 2020

archibate commented Feb 18, 2020 •

edited

Loading

archibate commented Feb 18, 2020

yuanming-hu commented Feb 18, 2020

yuanming-hu commented Feb 18, 2020

archibate commented Feb 19, 2020 •

edited

Loading

yuanming-hu commented Feb 19, 2020

archibate commented Feb 19, 2020 •

edited

Loading

yuanming-hu commented Feb 19, 2020

archibate commented Feb 19, 2020

archibate commented Feb 19, 2020 •

edited

Loading

archibate commented Feb 19, 2020 •

edited

Loading

archibate commented Feb 19, 2020

archibate commented Feb 25, 2020

archibate commented Feb 25, 2020

archibate commented Feb 25, 2020 •

edited

Loading

k-ye commented Feb 25, 2020

archibate commented Feb 25, 2020 •

edited

Loading

archibate commented Feb 25, 2020

k-ye commented Feb 25, 2020

archibate commented Feb 27, 2020

[OpenGL] Basic OpenGL backend #492 #495

[OpenGL] Basic OpenGL backend #492 #495

Conversation

archibate commented Feb 18, 2020

Related issue id = #492

archibate commented Feb 18, 2020 • edited Loading

yuanming-hu commented Feb 18, 2020 • edited Loading

k-ye commented Feb 18, 2020

yuanming-hu commented Feb 18, 2020 • edited Loading

archibate commented Feb 18, 2020

archibate commented Feb 18, 2020

archibate commented Feb 18, 2020 • edited Loading

archibate commented Feb 18, 2020

yuanming-hu commented Feb 18, 2020

yuanming-hu commented Feb 18, 2020

archibate commented Feb 19, 2020 • edited Loading

yuanming-hu commented Feb 19, 2020

archibate commented Feb 19, 2020 • edited Loading

yuanming-hu commented Feb 19, 2020

archibate commented Feb 19, 2020

archibate commented Feb 19, 2020 • edited Loading

archibate commented Feb 19, 2020 • edited Loading

archibate commented Feb 19, 2020

archibate commented Feb 25, 2020

archibate commented Feb 25, 2020

archibate commented Feb 25, 2020 • edited Loading

k-ye commented Feb 25, 2020

archibate commented Feb 25, 2020 • edited Loading

archibate commented Feb 25, 2020

k-ye commented Feb 25, 2020

archibate commented Feb 27, 2020

archibate commented Feb 18, 2020 •

edited

Loading

yuanming-hu commented Feb 18, 2020 •

edited

Loading

yuanming-hu commented Feb 18, 2020 •

edited

Loading

archibate commented Feb 18, 2020 •

edited

Loading

archibate commented Feb 19, 2020 •

edited

Loading

archibate commented Feb 19, 2020 •

edited

Loading

archibate commented Feb 19, 2020 •

edited

Loading

archibate commented Feb 19, 2020 •

edited

Loading

archibate commented Feb 25, 2020 •

edited

Loading

archibate commented Feb 25, 2020 •

edited

Loading