Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DASH processManifest performance regression in 3.x #4062

Closed
philippe-elsass-deltatre opened this issue Mar 24, 2022 · 65 comments · Fixed by #4009
Closed

DASH processManifest performance regression in 3.x #4062

philippe-elsass-deltatre opened this issue Mar 24, 2022 · 65 comments · Fixed by #4009
Assignees
Labels
component: DASH The issue involves the MPEG DASH manifest format priority: P1 Big impact or workaround impractical; resolve before feature release status: archived Archived and locked; will not be updated type: bug Something isn't working correctly type: performance A performance issue
Milestone

Comments

@philippe-elsass-deltatre
Copy link

philippe-elsass-deltatre commented Mar 24, 2022

Have you read the FAQ and checked for duplicate open issues?
Yes

What version of Shaka Player are you using?
2.5.x, 3.x

Can you reproduce the issue with our latest release version?
Yes

Can you reproduce the issue with the latest code from main?
Didn't try

Are you using the demo app or your own custom app?
Custom

If custom app, can you reproduce the issue using our demo app?
N/A

What browser and OS are you using?
webOS, Tizen

For embedded devices (smart TVs, etc.), what model and firmware version are you using?
webOS 5.00.35
Tizen firmware 2200.9

What are the manifest and license server URIs?

What configuration are you using? What is the output of player.getConfiguration()?

Default configuration

What did you do?

Play a live stream with large DVR (several hours) using different versions of Shaka Player on webOS and Tizen TV (low performance devices)

What did you expect to happen?
Playback should be smooth

What actually happened?

Using Shaka Player 2.5.x, Shaka is able to load and process the growing live manifest every few seconds.

Using Shaka Player 3.x, the processManifest step takes several seconds.

With a 4+h DVR, every time the manifest is reloaded:

  • On Tizen, it takes close to 2s
  • On webOS, it takes over 10s

There is visibly a considerable performance degradation of the manifest parsing logic, making it barely tolerable on Tizen (UI is frozen during the parsing), and completely unusable on webOS.

@philippe-elsass-deltatre philippe-elsass-deltatre added the type: bug Something isn't working correctly label Mar 24, 2022
@github-actions github-actions bot added this to the v3.3 milestone Mar 24, 2022
@philippe-elsass-deltatre
Copy link
Author

I did some profiling and code reading and I strongly suspect a refactoring which happened between 2.5 and 3:

  • Most of the for loops have been converted to enumeration loops
  • Enumerations are using JS generators
  • Generators seem to be transpiled to ES5...

image

Example in MpdUtils.js:

// Shaka 2.5.x
for (let i=0; i < timePoints.length; ++i) {

// Shaka 3.x
const enumerate = (it) => shaka.util.Iterables.enumerate(it);
for (const {item: timePoint, next} of enumerate(timePoints)) {

This looks like an absolutely catastrophic refactoring idea.

@avelad
Copy link
Member

avelad commented Mar 24, 2022

@joeyparrish / @theodab , can you check this? If so, I think it should be changed to improve performance.

@avelad
Copy link
Member

avelad commented Mar 25, 2022

I have created #4064 to fix these issues.

@philippe-elsass-deltatre
Copy link
Author

Amazing @avelad - I was precisely doing these changes locally to see if it was fixing the issue :D

@avelad
Copy link
Member

avelad commented Mar 25, 2022

@philippe-elsass-deltatre Have you seen performance improvements with the fix?

@philippe-elsass-deltatre
Copy link
Author

@avelad when profiling with Chrome browser there is a 10x improvement at least.

@philippe-elsass-deltatre
Copy link
Author

However I'm seeing a worrying climbing of the memory

@avelad
Copy link
Member

avelad commented Mar 25, 2022

Have you seen any memory leak?

@philippe-elsass-deltatre
Copy link
Author

There seem to be a massive memory leak: even after unloading Shaka I see memory climbing and if I pause JS I end up in a setTimeout callback looking at variants/codecs/decodingInfos

@avelad
Copy link
Member

avelad commented Mar 25, 2022

And is it caused by this change or is it reproduced in the master branch?

@philippe-elsass-deltatre
Copy link
Author

I don't know that's odd - the player seem to be leaking if I build from source tag v3.3.2. Maybe I'm doing it wrong.

@philippe-elsass-deltatre
Copy link
Author

Is there something faster than all.py to build the library? build.py doesn't seem to produce the shaka-compiled.js

@philippe-elsass-deltatre
Copy link
Author

No I'm good, building from source is fine for v3.3.2 and main - doesn't seem to leak.

@philippe-elsass-deltatre
Copy link
Author

Ok finally your branch doesn't leak - I did have something wrong.

Performance-wise with release JS the branch is about 40% faster - I'll try that on device now.

@philippe-elsass-deltatre
Copy link
Author

philippe-elsass-deltatre commented Mar 25, 2022

So... I under-estimated the penalty of the debug JS and my alarming "10s" on LG was a lot due to being a debug build (I couldn't find where to add timings otherwise).

Still the branch brings is a major improvement on LG device.

@philippe-elsass-deltatre
Copy link
Author

Unfortunately Shaka 2.5.9 is still MUCH faster. Our LGs stutter with a live stream with 4h DVR with Shaka 3 (even optimised) while Shaka 2 handles it without a sweat.

@avelad
Copy link
Member

avelad commented Mar 25, 2022

The good thing is that the fix is good, although it does not improve everything that it should, but it would be necessary to see what other parts can be optimized.

@joeyparrish joeyparrish added type: performance A performance issue priority: P1 Big impact or workaround impractical; resolve before feature release component: DASH The issue involves the MPEG DASH manifest format labels Mar 25, 2022
@joeyparrish
Copy link
Member

Thank you, @avelad, for fixing this! Should we leave this issue open for further investigation, or should PR #4064 close it?

@avelad
Copy link
Member

avelad commented Mar 25, 2022

I think that PR is an improvement but this issue should keep open for more investigation

@joeyparrish
Copy link
Member

I have done some analysis and have concluded that ES6 generators, even when not transpiled to ES5, are monstrously slow. We use them in two methods of shaka.util.Iterables and in two places in lib/cea/.

Here's a comparison of three ways to iterate through a large array of 100k items:

  1. Using shaka.util.Iterables.enumerate as transpiled to ES5
  2. Using shaka.util.Iterables.enumerate in raw ES6
  3. Using a version of shaka.util.Iterables.enumerate in raw ES6 with the generator removed

The results are shocking. The first two methods are similar (159.8 runs / second vs 176.2 runs / second). So the compiler does not seem to add much in the way of overhead (around 10%).

But the third method, which is only two lines difference from the second, runs at 3141.2 runs / second, or 17.8x faster.

So we need to get rid of all generator usage and ban it at the compiler level.

To reproduce these results, open https://jsbench.me/jnl16wbdor/1 and click "Run" at the top. Keep the tab in the foreground, or else the results will be thrown off by throttling in the browser.

@joeyparrish
Copy link
Member

@philippe-elsass-deltatre, please try this build which removes all generators (and makes no other changes):

If that is a meaningful improvement in performance for you, we can pursue something like this as a solution. (Though this is just a drop-in, API compatible replacement of generators with returned Arrays. We may want to evaluate individual cases and make more changes like @avelad's.)

@philippe-elsass-deltatre
Copy link
Author

Nice - I've never been a big fan of generators but now I have an excuse :D

One thing I notice in Shaka 3 VS Shaka 2 is MUCH larger times spent in GC. An enumerate function returning Arrays is ok for simple cases, but generally causes more garbage to be generated. One code part I found suspicious (but happened to be the same in v2) was the MpdUtils.inheritAttribute which creates multiple arrays where an old-school if/else would create less garbage.

@joeyparrish
Copy link
Member

Looks like the release bot closed this issue when it shouldn't have.

@joeyparrish joeyparrish modified the milestones: v4.0, Backlog May 23, 2022
@joeyparrish joeyparrish self-assigned this May 23, 2022
@cristian-atehortua
Copy link
Contributor

cristian-atehortua commented Aug 12, 2022

Hi, just looking at this issue because we are facing a big rebuffering problem on webOS devices 3.x and 4.4.
Our live manifests have a 4H DVR window and because of that Audio tracks on it are represented by a SegmentTimeline with around 2500 Segments. That is because each audio segment's duration differs from the next.

I saw the function that is wasting more time to execute is MpdUtils.createTimeline and a big part of that time is what the system waste on getAtribute for each attribute of each segment.

I was thinking that a good approach to solve the issue is to make this processing on demand, only when the segment is about to be used. I know this implies a big refactor on the way SegmentIndex works but maybe you can take this as a reference to improve it. What do you think?

@joeyparrish
Copy link
Member

Thanks for the hint! Yes, I think you're right that this would be a big refactor to SegmentIndex. Perhaps we can find a new way to abstract SegmentReference to make this work. A parse-on-demand structure would require us to keep the XML elements in memory, but perhaps this would still be a win.

@avelad
Copy link
Member

avelad commented Feb 9, 2023

I think the problem has been mitigated with the latest versions. can you check it?

@avelad avelad added the status: waiting on response Waiting on a response from the reporter(s) of the issue label Feb 9, 2023
@cristian-atehortua
Copy link
Contributor

Hi @avelad, I didn't look to perform better in the latest versions.

We found that this has a special impact when a live manifest has more than one audio track. It looks like Shaka parses each AdaptationSet even if that audio track is not selected. Maybe a "quick" win could be to delay parsing of the audio tracks until they're selected. What do you think @joeyparrish?

@github-actions github-actions bot removed the status: waiting on response Waiting on a response from the reporter(s) of the issue label Feb 10, 2023
@theodab
Copy link
Contributor

theodab commented Feb 14, 2023

We already added support for delaying the parsing of HLS media playlists, so some of the work for variants not fully loading until chosen is already done.
I'm still not sure if the required changes to the DASH parser would be "quick", though, even with that. When we changed the HLS parser that way, we were finding new bugs with it for weeks.

@cristian-atehortua
Copy link
Contributor

Well, based on the workaround we applied I think delaying could be a practical solution.

To workaround on the issue we preprocess the manifest and we just keep a single audio track, deleting completely the AdaptationSets that don't match the user's language preference. This is a feature loss but we had to do it because if not, our streams will not be playable. I think we can avoid the feature loss if Shaka optimize the way it process the manifest.

Just to clarify, this happens with our live streams that has the representation with SegmentTimeline and only on WebOS v3 and v4 since this platforms looks to be very limited in resources.

@theodab
Copy link
Contributor

theodab commented Feb 14, 2023

Well, I don't think it will have memory savings as great as preprocessing the manifest by removing entire tracks. We will need to keep the stuff we haven't parsed in memory, after all, so it can be parsed later. But I think it's probably worth a try.

I have other higher-priority issues I need to get to first, though, so I won't be able to work on this right away.

@avelad
Copy link
Member

avelad commented Jul 5, 2023

I think the problem has been mitigated in the main branch (see f1c5a1c), can you check it? Thanks!

@avelad avelad added the status: waiting on response Waiting on a response from the reporter(s) of the issue label Jul 5, 2023
@github-actions
Copy link
Contributor

Closing due to inactivity. If this is still an issue for you or if you have further questions, the OP can ask shaka-bot to reopen it by including @shaka-bot reopen in a comment.

@github-actions github-actions bot removed the status: waiting on response Waiting on a response from the reporter(s) of the issue label Jul 12, 2023
@avelad avelad modified the milestones: Backlog, v4.4 Aug 28, 2023
@shaka-bot shaka-bot added the status: archived Archived and locked; will not be updated label Sep 10, 2023
@shaka-project shaka-project locked as resolved and limited conversation to collaborators Sep 10, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
component: DASH The issue involves the MPEG DASH manifest format priority: P1 Big impact or workaround impractical; resolve before feature release status: archived Archived and locked; will not be updated type: bug Something isn't working correctly type: performance A performance issue
Projects
None yet
Development

Successfully merging a pull request may close this issue.

6 participants