Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Defer processing of KML to allow for smoother performance #3970

Closed
wants to merge 8 commits into from
Closed

Defer processing of KML to allow for smoother performance #3970

wants to merge 8 commits into from

Conversation

mmacaula
Copy link
Contributor

We were running into an issue with large KML files, a ~33mb KML file would cause the whole browser tab to crash. You may say, that's just too big, but it zips down to a 2.1Mb Kmz file (email me for a sample file) and I'm sure there are lots of those around. I tracked it down to all kml processing being done in single large chain of methods that didn't release the main thread.

I took advantage of the fact that load is currently promise based already and 'promisified' the process* functions. The result is that Cesium will no longer just hang when processing large kml files.

Since when.js isn't quite a+ compliant yet, and slightly slower (see #3967 ) this sacrifices some total time to process in order to not cause the browser to freeze. Once promises are upgrade, the deferAndWrapInPromise code can likely be removed in place of a when.lift or other equivalent. To me, this is worthwhile for user experience but would be happy to discuss further if it's an issue.

I didn't create an issue for this but can if that fits your workflow better.

@pjcozzi
Copy link
Contributor

pjcozzi commented May 27, 2016

Thanks @mmacaula! Sounds like a nice improvement.

Can you update CHANGES.md and is there a reasonable way to unit test this?

@tfili can you review and merge when ready?

@mmacaula
Copy link
Contributor Author

The nice thing is that this doesn't change anything functionally, so the current unit tests were an excellent 'security blanket' that I didn't break anything. I also did some of my own testing. If @tfili has any suggestions for testing, I'd be happy to write some more.

@mramato
Copy link
Contributor

mramato commented May 31, 2016

I need to look a little closer at the code, but at first glance this looks great. I want to try it with some large KML to see how much it affects performance.

Thanks @mmacaula!

@@ -1947,20 +1945,52 @@ define([
Tour : processUnsupported
};

var promisifiedFeatureTypes = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's no need for both promisifiedFeatureTypes and featureTypes, I would just use featureTypes and update the functions.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sure

var deferred = when.defer();
var args = Array.prototype.slice.call(arguments);
var context = this;
setTimeout(function deferredFunc(){
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using setTimeout here is not an ideal solution because it happens outside the requestAnimationFrame event loop (and therefore has negative impact on overall performance). In general, setTimeout should be avoided in any Cesium application for this reason.

Instead, we have DataSource.update, which is an optional function that will be called every frame during the event loop. I think what we would want is incremental loading to happen by doing a little bit of KML work in every call to update (either through time-boxing or other form of work counting). This approach may not lend itself well to the Promise-based approach (I'd have to think about it some more), but would be a much better solution (and is something we should probably due in all of our Datasources. If you add KmlDataSource.prototype.update it should automatically start getting called every frame.

Perhaps we just change the processXXX functions to always return a "next" function that we can call to continue processing. In update, we would call the next function as many times as possible in some pre-determined allotted time. I don't think it should be hard to do. What do you think?

@shunter any thoughts here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, I only used setTimeout because the current version of when.js in cesium didn't support setting up stuff asynchronously (at least I couldn't tell in my research). A more updated promise lib that was a+ compatible might do things in a more performant way without having to burden the render loop with non-rendering stuff. I could always try stuffing bluebird in there and profiling and comparing to when.js to see if it makes a difference. Anything to look for as a tell for setTimeout being slow?

That said, timeboxing update calls might be a big win anyway. With very large kml files, we see very long calls to update as cesium processes the initial entities produced by the giant kml we use.

I'll wait for @shunter before doing anything now.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please don't tie loading of data too closely to the Cesium render loop. It will break applications that shut down Cesium's render loop to avoid burning 100% CPU all the time. I don't mind if I need to do extra work in my app to make loading progress while rendering is stopped, but please think about how it's possible at least.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mramato I've been churning it over and I think our best bet is still to use setTimeout, at least for now. Tying it too closely with the render loop would have unintended side effects (as @kring pointed out). It will impact performance, but at the browsers discretion. In my testing, the browser still runs the render loop (and other browser stuff like mouse clicks etc) while this processing is going on.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've been doing more research and really diving into event loops and microtasks. There definitely are some issues with setTimeout and I'm sure you've seen that it does take longer to process now. Luckily there are some clever ways out there that let you schedule a task on the next tick of the event loop without using setTimeout.

Resources:
https://www.nczonline.net/blog/2013/07/09/the-case-for-setimmediate/
And a polyfill that we could use to replace the setTimeout call:
https://github.com/YuzuJS/setImmediate

The demo / test site is very compelling:
http://jphpsf.github.io/setImmediate-shim-demo/

As another aside, opening devtools on this github page, and typing 'setImmediate` I see that github is using what appears to be this exact polyfill (though it's minified).

Thoughts?

@pjcozzi
Copy link
Contributor

pjcozzi commented Jan 2, 2017

This pull request hasn't been updated in six months.

@mmacaula @kring @mramato is this approach viable or should this be closed?

@pjcozzi
Copy link
Contributor

pjcozzi commented Jan 15, 2017

Closing since there was no response to #3970 (comment).

@mmacaula thanks for putting energy into this. If you think this - or a subset of it - is ready for production, please open another pull request. Otherwise, feel free to submit an issue for future work.

@mmacaula
Copy link
Contributor Author

@pjcozzi I think it's still a worthwhile improvement, tying up the event loop is a big detriment to users. This does need to be updated though. I've been on other projects, one issues this PR had was creating promises efficiently due to the promise library being fairly old. I heard you were changing it, is it updated now? I'd be happy to re-work so I can avoid the setTimeouts. What do you think @mramato ?

@mramato
Copy link
Contributor

mramato commented Jan 25, 2017

Yeah, I stink; sorry for not getting back to this sooner. We haven't updated promise libraries yet (though I would like to get on that soon, I started a branch but it got messy quick). As before, my main concern here is performance, I will take another look at this branch sometime this week or weekend and see if I was just being a curmudgeon the first time around. If so, we can take this as an incremental improvement until we have a better solution.

@mmacaula
Copy link
Contributor Author

My preference would be to bring in the setImmediate polyfill and convert the setTimeouts to use that to build the promises. Then we'd not really sacrifice performance while giving the thread back to the browser periodically.

@jeremywilliams
Copy link

Are any other updates going to be made to this PR?

@mramato
Copy link
Contributor

mramato commented May 30, 2017

@jeremywilliams I'm planning on revisiting this PR in June in the hopes of integrating it with the new JobScheduler we have in master. I'll update this PR with details when I know more.

@juburr
Copy link
Contributor

juburr commented Sep 1, 2017

@mramato Is this still in the works?

@cesium-concierge
Copy link

Thanks again for your contribution @mmacaula!

No one has commented on this pull request in 30 days. Maintainers, can you review, merge or close to keep things tidy?

I'm going to re-bump this in 30 days. If you'd like me to stop, just comment with @cesium-concierge stop. If you want me to start again, just delete the comment.


I am a bot who helps you make Cesium awesome! Contributions to my configuration are welcome.

🌍 🌎 🌏

@hpinkos
Copy link
Contributor

hpinkos commented Dec 12, 2018

@cesium-concierge stop

@OmarShehata
Copy link
Contributor

@mramato this is the issue I was referring to today when we were talking about vector data crashing Cesium. It sounds it's an issue of just not releasing the main thread as opposed to there just not being enough memory.

@hpinkos
Copy link
Contributor

hpinkos commented Sep 18, 2019

@tfili is going to try to look at this soon

@mramato
Copy link
Contributor

mramato commented Sep 24, 2019

Thanks again, this was replaced with #8195 and will go out with the Oct 1 release.

@mramato mramato closed this Sep 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants