Make high resolution scrolling more responsive and configurable #1129

maximbaz · 2018-11-09T12:28:46Z

I was testing high resolution scrolling under Wayland and found that scrolling becomes acceptable if I multiply yoffset by 10 and awesome if I multiply it by 20.

Since wheel_scroll_multiplier is not used under high resolution scrolling at all, I thought we could put it to good use here, so that responsiveness becomes configurable.

Under Wayland, my viewport_y_ratio is 2. I added it to this patch because it was suggested by @Luflosi in #1112, if you want to keep it, 82f9aec probably needs to be reverted.

With this formula, scrolling is acceptable with default settings and I can further improve it if I change wheel_scroll_multiplier to 10.

If we remove viewport_y_ratio from the formula, the default value of wheel_scroll_multiplier (5) is too slow, and I would need to change it to at least 10, but better to 20.

Luflosi · 2018-11-09T13:23:15Z

The default for wheel_scroll_multiplier is still 5, right? This will make kitty scroll five times as fast apter applying this patch. Is it possible to change the default to 1 when high resolution scrolling is supported?

maximbaz · 2018-11-09T13:25:56Z

Yes, by setting wheel_scroll_multiplier to 1 in kitty.conf 🙂 But comparing to all other apps, kitty is scrolling much slower, and with this patch it scrolls approximately as fast as others (and by setting wheel_scroll_multiplier to 10 I get the speed of sakura).

UPDATE: I misunderstood what you are suggesting, sorry, you are proposing to change default value to 1. Still think 5 is good for a default value, based on a personal experience comparing scrolling speed with other apps as I described above.

Luflosi · 2018-11-09T13:43:19Z

At least on macOS a value of 1 makes kitty scroll as fast as all other apps.

maximbaz · 2018-11-09T13:45:45Z

Just to confirm, are you testing with 82f9aec reverted? Because both that commit and this PR increase the speed on macOS.

In any case, I don't really care about the default value, I'm totally fine changing it in my local config as long as it's configurable 😉

Luflosi · 2018-11-09T13:52:56Z

That commit doesn't make a difference for me since yscale is 1 for me since I don't have a retina display. And I didn't suggest changing the default value for everyone, only where high precision scrolling is supported. I think that 5 lines per scroll event are pretty reasonable when scrolling with a mouse that doesn't support high precision scrolling.

maximbaz · 2018-11-09T13:55:51Z

Maybe it wasn't a good idea then to re-use wheel_scroll_multiplier option and we better have a different one for high precision scrolling...

Luflosi · 2018-11-09T14:58:32Z

I think the semantics of that option make perfect sense for low and high precision scrolling and we should not create a new option.

maximbaz · 2018-11-09T16:48:06Z

If the option will have different values based on whether the device is "high" or "low" precision, users would need to have a way to override both these values, and this will be impossible I think...

In other words, I would like to override wheel_scroll_multiplier to 10 to get the speed as in sakura with touchpad, but I don't want to change the default 5 for when I use a mouse.

kovidgoyal · 2018-11-10T05:26:43Z

That is not the right place for this patch. It belongs in glfw/wl_window.c just like the fix for macOS that I committed recently.basically the backend should be responsible for providing the correct number of scrolled pixels as reported by the OS. kitty should not be messing with that value.

As for adding a configurable multiplier, I agree it should be a separate option, after all the current one is named "wheel_scroll_multiplier" so the new one could be named "touch_scroll_multiplier" or similar, and it should default to 1.

kovidgoyal · 2018-11-10T05:30:21Z

It would be interesting to see what VTE does on wayland. Why is it scrolling faster than kitty? Does it multiply the pixels by something and if so where does the value of that something come from? @egmontkob you know the answer?

egmontkob · 2018-11-10T09:08:06Z

GTK+ gives us a smooth scrolling delta which hides from us the X11 vs. Wayland differences, and is probably already scaled by some global GTK+ or GNOME setting (I'm not sure about it). I'm not aware of these details.

Then in VTE it's scaled according to ceil(height / 10.0) here, which may not be a good approach as per 748012. See also 769696.

kovidgoyal · 2018-11-10T09:33:51Z

@egmontkob thanks for the pointers

maximbaz · 2018-11-10T11:21:48Z

Alright, I don't think I should modify glfw/wl_window.c at this point as I don't really know if it does report wrong number of pixels on HiDPI screen or not, I don't have other Wayland apps to compare to (sakura is x20 faster, and Chromium and Firefox run in X-compatibility mode on Wayland so I can't take them as an example), and I don't want to make backend arbitrarily respond with twice as large numbers.

Let's start with implementing touch_scroll_multiplier, I'll get on it.

ducis · 2020-03-11T01:25:36Z

Hello, I am wondering, in the current master branch, when the last line is updated (or a new line is added to the bottom), does kitty redraw the whole window? like if there is an "A" in the second line from the bottom, then the first line from the bottom changed, does kitty go through the process of rasterizing the letter "A" again? I assume if it does not need to (by holding a buffer of some previously rasterized parts of the terminal window), both high-prec scrolling would be easy and less computation would be needed?

ducis · 2020-03-11T01:31:36Z

So far as I know, a terminal program either is only able to change some lines from the bottom (normal terminal program, and tab completion) or takes control of the whole (virtual) terminal (like vim, tmux, or clear), it does not change lines from the top or in the middle. In the latter case you don't have scrolling anyway. So we can always assume a terminal program changes a certain number of lines from the bottom. Is it correct?

kovidgoyal · 2020-03-11T01:41:23Z

kitty only ever rasterizes a character once, that's part of the magic
behind its performance. And no nay terminal program
can make changes to any part of the screen at any time, for example by
simply changing the background color, or directly by moving the cursor
around the screen.

ducis · 2020-03-11T02:26:52Z

Thanks. Then where shall I start looking into the scrolling/rendering stuff? In particular, which variables in the source refer to the previously rasterized content?

kovidgoyal · 2020-03-11T02:35:45Z

there is no previously rasterized content. rendering happens on the GPU.
If you want to get previously rendered content you have to render into a
framebuffer. There is code that does that for other reasons in shaders.c
but you are going to need familiarity with OpenGL to make it work

ducis · 2020-03-11T03:07:04Z

So by "rasterizes a character once" you meant there are caches like many images with the size of individual characters in the graphics memory?

I guess by "does that for other reasons in shaders.c" you mean draw_cells_interleaved_premult().
Is this function triggered in, say, a fresh install of kitty without customized configuration file or passed commandline options? If not, how should I enable it so that I can test if it affects the latency first?

ducis · 2020-03-11T03:20:02Z

Does the framebuffer to render into live in the graphics memory or the main memory?
Usually for this kind of things it should live in the graphics memory and the application has pointer-like stuff with which the app can tell the GPU to do things like memcpy() purely in the graphics memory.
But I am not sure if you meant we have to fetch the rendered results into the main memory by "you have to render into a framebuffer", which would be much worse performance-wise.

kovidgoyal · 2020-03-11T03:25:40Z

On Tue, Mar 10, 2020 at 08:07:16PM -0700, ducis. wrote: So by "rasterizes a character once" you meant there are caches like many images with the size of individual characters in the graphics memory?

There is a sprite map stored on the GPU.

I guess by "does that for other reasons in shaders.c" you mean draw_cells_interleaved_premult(). Is this function triggered in, say, a fresh install of kitty without customized configuration file or passed commandline options? If not, how should I enable it so that I can test if it affects the latency first?

It will be used when you have a transparent window and images displayed in the window.

kovidgoyal · 2020-03-11T03:26:06Z

On Tue, Mar 10, 2020 at 08:20:13PM -0700, ducis. wrote: Does the framebuffer to render into live in the graphics memory or the main memory? Usually for this kind of things it should live in the graphics memory and the application has pointer-like stuff with which the app can tell the GPU to do things like memcpy() purely in the graphics memory. But I am not sure if you meant we have to fetch the rendered results into the main memory by "you have to render into a framebuffer", which would be much worse performance-wise.

framebuffers live in GPU memory

ducis · 2020-03-11T04:09:56Z

Hi, I just tested with typometer.
The environment is compiz (0.8.*, not the Ubuntu one) 60Hz frame rate globally,
both delays turned to 0, VSync enabled in compiz but disabled in kitty (so no tearing).
Transparency within kitty roughly doubles the latency from ~17ms to ~31ms,
with or without icatted images.
But transparency with compiz plugins (which can make any window translucent) results in latency ~26ms.
Would you think that the 17ms->31ms is caused by going through draw_cells_interleaved_premult()?
Am I missing anything?

kovidgoyal · 2020-03-11T04:18:38Z

Put a printf() in draw_cells_interleaved_premult() to check if it is going through that. IIRC without images it will not be used, see line 636 onwards of shaders.c for details. Either you need transparency + any image or non-transparent and negative z-index images.

ducis · 2020-03-11T05:04:25Z

I put in the printf() and tested again. Yes, draw..premult() is only triggered by both background_opacity and icat.
How transparency affects latency really depended on what windows were below it.
It seems that the more complex stuff are below a translucent window the more likely the computation will take more than one frame.
But in general transparency,
as well as draw_cells_interleaved_premult() when triggered by icat, do not force an additional frame.

kovidgoyal · 2020-03-11T05:18:15Z

You should be able to trigger it with just

kitty -o background_image=whatever.png

and no transparency (assuming you are running from master)

ducis · 2020-03-11T05:37:03Z

No, I am running from 0.16.0 which does not seem to have background_image,
but I just forced draw_cells_interleaved_premult() at the end of draw_cells().
Somehow it made the font bolder and much uglier when background_opacity=1,
so there was something wrong this way.
But regardless, the speed was just as fast as unmodded whether background_opacity = 1 or not,
so long as there was nothing below the window to be actually blended.

Maybe there should be an option to explicitly force rendering to a framebuffer
regardless of transparency or whatever?
So everyone can benchmark. After some time we may find out that
we can use an intermediate framebuffer with negligible penalty in performance,
which would then enable more fancy stuff.

I am even using Intel graphics, an additional copy would be even more forgiving on a real GPU.

kovidgoyal · 2020-03-11T05:47:10Z

Additional copies may have negligible performance penalty, in terms of
render time, but that's not my philosophy. My goal is to reduce energy
consumption, not just render time. That is why kitty has various
settings that actually insert latency into the render loop. And
performing unnecessary operations on all renders just for smooth
scrolling is not a worthwhile tradeoff.

I dont really see why that would be needed anyway. You could just switch
rendering to using a framebuffer while scrolling and turn it off again
after. It would mean writing a bit more code which you would have to do
anyway, since to actually implement scrolling you need to render stuff
above and below the window limits.

ducis · 2020-03-11T06:09:15Z

It is hard to say without actual implementation and benchmarking, but keeping a large buffer of results painted in the last frame might actually reduce computation at the expense of more VRAM.

But whatever, what is the current situation of smooth scrolling in the master branch?
I saw that you merged something from this issue and the Luflosi branch in
#1454,
but I couldn't figure out whether we can enable (some sort of) smooth scrolling in the current master.

kovidgoyal · 2020-03-11T06:18:59Z

there is no smooth scrolling #1454

Luflosi · 2020-03-11T11:49:50Z

@ducis We can work on this together, if you like.

ducis · 2020-03-11T14:45:58Z

@Luflosi
Once I switch completely from tmux.

Make high resolution scrolling more responsive and configurable

3f77a20

maximbaz mentioned this pull request Nov 9, 2018

Scroll by pixels, not by lines, and support acceleration #1123

Closed

Implement touch_scroll_multiplier

5e27c21

kovidgoyal merged commit 5e27c21 into kovidgoyal:master Nov 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make high resolution scrolling more responsive and configurable #1129

Make high resolution scrolling more responsive and configurable #1129

maximbaz commented Nov 9, 2018 •

edited

Loading

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018 •

edited

Loading

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018

kovidgoyal commented Nov 10, 2018

kovidgoyal commented Nov 10, 2018

egmontkob commented Nov 10, 2018

kovidgoyal commented Nov 10, 2018

maximbaz commented Nov 10, 2018

ducis commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020 via email

kovidgoyal commented Mar 11, 2020 via email

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020 •

edited

Loading

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020 •

edited

Loading

kovidgoyal commented Mar 11, 2020

Luflosi commented Mar 11, 2020

ducis commented Mar 11, 2020

Make high resolution scrolling more responsive and configurable #1129

Make high resolution scrolling more responsive and configurable #1129

Conversation

maximbaz commented Nov 9, 2018 • edited Loading

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018 • edited Loading

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018

Luflosi commented Nov 9, 2018

maximbaz commented Nov 9, 2018

kovidgoyal commented Nov 10, 2018

kovidgoyal commented Nov 10, 2018

egmontkob commented Nov 10, 2018

kovidgoyal commented Nov 10, 2018

maximbaz commented Nov 10, 2018

ducis commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020 via email

kovidgoyal commented Mar 11, 2020 via email

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020 • edited Loading

kovidgoyal commented Mar 11, 2020

ducis commented Mar 11, 2020 • edited Loading

kovidgoyal commented Mar 11, 2020

Luflosi commented Mar 11, 2020

ducis commented Mar 11, 2020

maximbaz commented Nov 9, 2018 •

edited

Loading

maximbaz commented Nov 9, 2018 •

edited

Loading

ducis commented Mar 11, 2020 •

edited

Loading

ducis commented Mar 11, 2020 •

edited

Loading