Is it possible to improve the performance of line()? #456

ceball · 2017-09-08T22:49:26Z

I started with examples/timeseries.ipynb, modifying to have only one column:

In a bit of an ugly process, I then time canvas.line() and canvas.points() for increasing repeats of the dataframe:

line seems to be a lot slower than points:

(Sorry for the screenshots, but github doesn't seem to allow notebooks to be attached.)

The text was updated successfully, but these errors were encountered:

ceball · 2017-09-08T23:09:37Z

I just looked at the source code and see line uses agg=any() by default, while points uses agg=count() by default. So just a note to confirm that the findings above don't change much if I use the same reduction for both line and points.

jbednar · 2017-09-08T23:46:56Z

Joseph or Greg had some ideas about how to speed it up, but fundamentally lines should always be slower than points because points just increments one bin, while line needs to solve the equation for a line and increment every point along the way.

ceball · 2017-09-08T23:52:28Z

Yes, I guess the title should have been more like, 'Is line the expected amount slower than points?', or maybe, 'Can line be made faster?'.

jbcrail · 2017-09-11T20:49:35Z

My immediate suggestions for performance gains are:

Switch to integer-specific Bresenham line-drawing algorithm

We currently use the more general algorithm that supports both ints and floats. However, by the time we need to draw a line, the points have been mapped to an integer space so float support is redundant.

Switch line-clipping algorithm

We currently use Cohen-Sutherland for clipping a line to a bounding box, but the Liang-Barsky algorithm is considered significantly more efficient.

jbednar · 2017-09-12T00:13:46Z

I'd be very happy to try both suggestions; we're using line a lot in some performance-critical applications, so anything will help...

I switched from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. Related to #456

Switched from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. Related to #456.

jbednar · 2017-10-18T20:50:11Z

It would be nice to re-run those benchmarks now that #495 has been merged.

Switched from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. Related to #456.

ceball changed the title ~~Why is line slower than points?~~ Is it possible to improve the performance of line()? Sep 9, 2017

jbcrail added a commit that referenced this issue Oct 16, 2017

Switch line-clipping algorithm

e478d8f

I switched from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. Related to #456

jbcrail mentioned this issue Oct 16, 2017

Switch line-clipping algorithm #495

Merged

jbednar pushed a commit that referenced this issue Oct 16, 2017

Switch line-clipping algorithm (#495)

edf1096

Switched from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. Related to #456.

jbednar pushed a commit that referenced this issue Oct 30, 2017

Switch line-clipping algorithm (#495)

82d4bcc

Switched from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. Related to #456.

jbednar mentioned this issue Dec 19, 2018

Datashader internals to-do list #672

Open

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to improve the performance of line()? #456

Is it possible to improve the performance of line()? #456

ceball commented Sep 8, 2017

ceball commented Sep 8, 2017

jbednar commented Sep 8, 2017 via email •

edited

Loading

ceball commented Sep 8, 2017

jbcrail commented Sep 11, 2017 •

edited

Loading

jbednar commented Sep 12, 2017

jbednar commented Oct 18, 2017

Is it possible to improve the performance of line()? #456

Is it possible to improve the performance of line()? #456

Comments

ceball commented Sep 8, 2017

ceball commented Sep 8, 2017

jbednar commented Sep 8, 2017 via email • edited Loading

ceball commented Sep 8, 2017

jbcrail commented Sep 11, 2017 • edited Loading

jbednar commented Sep 12, 2017

jbednar commented Oct 18, 2017

jbednar commented Sep 8, 2017 via email •

edited

Loading

jbcrail commented Sep 11, 2017 •

edited

Loading