Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GUI] [linux] Support "ti.GUI(fast_gui=True)" for zero-copy while gui.set_image(img) #1922

Merged
merged 6 commits into from
Oct 12, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
42 changes: 42 additions & 0 deletions docs/gui.rst
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Create a window
:parameter background_color: (optional, RGB hex) background color of the window
:parameter show_gui: (optional, bool) see the note below
:parameter fullscreen: (optional, bool) ``True`` for fullscreen window
:parameter fast_gui: (optional, bool) see :ref:`fast_gui`
:return: (GUI) an object represents the window

Create a window.
Expand Down Expand Up @@ -558,3 +559,44 @@ Image I/O
If ``h`` is not specified, it will be equal to ``w`` by default.

The output image shape is: ``(w, h, *img.shape[2:])``.


.. _fast_gui:

Zero-copying frame buffer
-------------------------

Sometimes when the GUI resolution (window size) is large, we find it impossible
to reach 60 FPS even without any kernel invocations between each frame.

This is mainly due to the copy overhead when Taichi GUI is copying image buffer
from a place to another place, to make high-level painting APIs like
``gui.circles`` functional. The larger the image, the larger the overhead.

However, in some cases we only need ``gui.set_image`` alone. Then we may turn
on the ``fast_gui`` mode for better performance.

It will directly write the image specified in ``gui.set_image`` to frame buffer
without hesitation, results in a much better FPS when resolution is huge.

To do so, simply initialize your GUI with ``fast_gui=True``:

.. code-block:: python

gui = ti.GUI(res, title, fast_gui=True)


.. note::

If possible, consider enabling this option, especially when ``fullscreen=True``.


.. warning::

Despite the performance boost, it has many limitations as trade off:

``gui.set_image`` is the only available paint API in this mode. **All other
APIs like ``gui.circles``, ``gui.rect``, ``gui.triangles``, etc., won't work**.

``gui.set_image`` will only takes Taichi 3D or 4D vector fields (RGB or RGBA)
as input.
21 changes: 21 additions & 0 deletions examples/fullscreen.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
import taichi as ti
import numpy as np

ti.init(ti.gpu)

res = (1920, 1080)
img = ti.Vector.field(3, float, res)


@ti.kernel
def render(t: float):
for i, j in img:
a = ti.Vector([i / res[0], j / res[1] + 2, i / res[0] + 4])
img[i, j] = ti.cos(a + t) * 0.5 + 0.5


gui = ti.GUI('UV', res, fullscreen=True, fast_gui=True)
while not gui.get_event(ti.GUI.ESCAPE):
render(gui.frame * 0.04)
gui.set_image(img)
gui.show()
22 changes: 0 additions & 22 deletions misc/demo_fullscreen.py

This file was deleted.

10 changes: 10 additions & 0 deletions python/taichi/lang/meta.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,16 @@ def cook_image_type(x):
return x


@ti.kernel
def vector_to_fast_image(img: ti.template(), out: ti.ext_arr()):
# FIXME: Why is ``for i, j in img:`` slower than:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you know how much it was slower?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you reproduce it? I'm building right now. Will tell you asap. This is likely an issue of row-major / col-major, wdyt?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No worries. I was just curious if the slowness was just some noise or not. It does sound like row-major vs column-major could be a factor. Thanks for confirming!

Did you reproduce it?

I tried running the example, but all i got was a black screen.. (Tried both RGB and BGR). Given that the original image was of type ti.f32, maybe that could be the reason? (Or do i actually need to pass fast_buf somewhere for Cocoa?)

for i, j in ti.ndrange(*img.shape):
u, v, w = min(255, max(0, int(img[i, img.shape[1] - 1 - j] * 255)))
# We use i32 for |out| since OpenGL and Metal doesn't support u8 types
# TODO: treat Cocoa and Big-endian machines, with XOR logic
out[j * img.shape[0] + i] = w + (v << 8) + (u << 16)


@ti.kernel
def tensor_to_image(tensor: ti.template(), arr: ti.ext_arr()):
for I in ti.grouped(tensor):
Expand Down
32 changes: 28 additions & 4 deletions python/taichi/misc/gui.py
Original file line number Diff line number Diff line change
Expand Up @@ -40,19 +40,31 @@ def __init__(self,
res=512,
background_color=0x0,
show_gui=True,
fullscreen=False):
fullscreen=False,
fast_gui=False):
if 'TI_GUI_SHOW' in os.environ:
show_gui = bool(int(os.environ['TI_GUI_SHOW']))
if 'TI_GUI_FULLSCREEN' in os.environ:
fullscreen = bool(int(os.environ['TI_GUI_FULLSCREEN']))
if 'TI_GUI_FAST' in os.environ:
fast_gui = bool(int(os.environ['TI_GUI_FAST']))

self.name = name
if isinstance(res, numbers.Number):
res = (res, res)
self.res = res
# The GUI canvas uses RGBA for storage, therefore we need NxMx4 for an image.
self.img = np.ascontiguousarray(np.zeros(self.res + (4, ), np.float32))
self.core = ti_core.GUI(name, core_veci(*res), show_gui, fullscreen)
self.fast_gui = fast_gui
if fast_gui:
self.img = np.ascontiguousarray(
np.zeros(self.res[0] * self.res[1], dtype=np.uint32))
fast_buf = self.img.ctypes.data
else:
# The GUI canvas uses RGBA for storage, therefore we need NxMx4 for an image.
self.img = np.ascontiguousarray(
np.zeros(self.res + (4, ), np.float32))
fast_buf = 0
self.core = ti_core.GUI(name, core_veci(*res), show_gui, fullscreen,
fast_gui, fast_buf)
self.canvas = self.core.get_canvas()
self.background_color = background_color
self.key_pressed = set()
Expand Down Expand Up @@ -143,6 +155,18 @@ def set_image(self, img):
import numpy as np
import taichi as ti

if self.fast_gui:
assert isinstance(img, ti.Matrix), \
"Only ti.Vector.field is supported in GUI.set_image when fast_gui=True"
assert img.shape == self.res, \
"Image resolution does not match GUI resolution"
assert img.n in [3, 4] and img.m == 1, \
"Only RGB images are supported in GUI.set_image when fast_gui=True"

from taichi.lang.meta import vector_to_fast_image
vector_to_fast_image(img, self.img)
return

if isinstance(img, ti.Expr):
if ti.core.is_integral(img.dtype) or len(img.shape) != 2:
# Images of uint is not optimized by xxx_to_image
Expand Down
12 changes: 11 additions & 1 deletion taichi/gui/gui.h
Original file line number Diff line number Diff line change
Expand Up @@ -506,6 +506,8 @@ class GUI : public GUIBase {
std::vector<std::unique_ptr<float>> widget_values;
bool show_gui;
bool fullscreen;
bool fast_gui;
uintptr_t fast_buf;

void set_mouse_pos(int x, int y) {
cursor_pos = Vector2i(x, y);
Expand Down Expand Up @@ -763,13 +765,17 @@ class GUI : public GUIBase {
int height = 800,
bool show_gui = true,
bool fullscreen = true,
bool fast_gui = false,
uintptr_t fast_buf = 0,
bool normalized_coord = true)
: window_name(window_name),
width(width),
height(height),
key_pressed(false),
show_gui(show_gui),
fullscreen(fullscreen) {
fullscreen(fullscreen),
fast_gui(fast_gui),
fast_buf(fast_buf) {
memset(button_status, 0, sizeof(button_status));
start_time = taichi::Time::get_time();
buffer.initialize(Vector2i(width, height));
Expand All @@ -788,12 +794,16 @@ class GUI : public GUIBase {
Vector2i res,
bool show_gui,
bool fullscreen = true,
bool fast_gui = false,
uintptr_t fast_buf = 0,
bool normalized_coord = true)
: GUI(window_name,
res[0],
res[1],
show_gui,
fullscreen,
fast_gui,
fast_buf,
normalized_coord) {
}

Expand Down
22 changes: 20 additions & 2 deletions taichi/gui/x11.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,9 @@ class CXImage {
public:
XImage *image;
std::vector<uint8> image_data;
void *fast_data{nullptr};
int width, height;

CXImage(Display *display, Visual *visual, int width, int height)
: width(width), height(height) {
image_data.resize(width * height * 4);
Expand All @@ -29,6 +31,17 @@ class CXImage {
TI_ASSERT((void *)image->data == image_data.data());
}

CXImage(Display *display,
Visual *visual,
void *fast_data,
int width,
int height)
: width(width), height(height) {
image = XCreateImage(display, visual, 24, ZPixmap, 0, (char *)fast_data,
width, height, 32, 0);
TI_ASSERT((void *)image->data == fast_data);
}

void set_data(const Array2D<Vector4> &color) {
auto p = image_data.data();
for (int j = 0; j < height; j++) {
Expand Down Expand Up @@ -164,11 +177,16 @@ void GUI::create_window() {
XSetWMProtocols((Display *)display, window, (Atom *)wmDeleteMessage.data(),
1);
XMapWindow((Display *)display, window);
img = new CXImage((Display *)display, (Visual *)visual, width, height);
if (!fast_gui)
img = new CXImage((Display *)display, (Visual *)visual, width, height);
else
img = new CXImage((Display *)display, (Visual *)visual, (void *)fast_buf,
width, height);
}

void GUI::redraw() {
img->set_data(buffer);
if (!fast_gui)
img->set_data(buffer);
XPutImage((Display *)display, window, DefaultGC(display, 0), img->image, 0, 0,
0, 0, width, height);
}
Expand Down
2 changes: 1 addition & 1 deletion taichi/python/export_visual.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ void export_visual(py::module &m) {
.value("Press", Type::press)
.value("Release", Type::release);
py::class_<GUI>(m, "GUI")
.def(py::init<std::string, Vector2i, bool, bool>())
.def(py::init<std::string, Vector2i, bool, bool, bool, uintptr_t>())
.def_readwrite("frame_delta_limit", &GUI::frame_delta_limit)
.def_readwrite("should_close", &GUI::should_close)
.def("get_canvas", &GUI::get_canvas, py::return_value_policy::reference)
Expand Down