DASH processManifest performance regression in 3.x #4062

philippe-elsass-deltatre · 2022-03-24T13:09:32Z

Have you read the FAQ and checked for duplicate open issues?
Yes

What version of Shaka Player are you using?
2.5.x, 3.x

Can you reproduce the issue with our latest release version?
Yes

Can you reproduce the issue with the latest code from main?
Didn't try

Are you using the demo app or your own custom app?
Custom

If custom app, can you reproduce the issue using our demo app?
N/A

What browser and OS are you using?
webOS, Tizen

For embedded devices (smart TVs, etc.), what model and firmware version are you using?
webOS 5.00.35
Tizen firmware 2200.9

What are the manifest and license server URIs?

What configuration are you using? What is the output of player.getConfiguration()?

Default configuration

What did you do?

Play a live stream with large DVR (several hours) using different versions of Shaka Player on webOS and Tizen TV (low performance devices)

What did you expect to happen?
Playback should be smooth

What actually happened?

Using Shaka Player 2.5.x, Shaka is able to load and process the growing live manifest every few seconds.

Using Shaka Player 3.x, the processManifest step takes several seconds.

With a 4+h DVR, every time the manifest is reloaded:

On Tizen, it takes close to 2s
On webOS, it takes over 10s

There is visibly a considerable performance degradation of the manifest parsing logic, making it barely tolerable on Tizen (UI is frozen during the parsing), and completely unusable on webOS.

The text was updated successfully, but these errors were encountered:

philippe-elsass-deltatre · 2022-03-24T16:44:56Z

I did some profiling and code reading and I strongly suspect a refactoring which happened between 2.5 and 3:

Most of the for loops have been converted to enumeration loops
Enumerations are using JS generators
Generators seem to be transpiled to ES5...

Example in MpdUtils.js:

// Shaka 2.5.x
for (let i=0; i < timePoints.length; ++i) {

// Shaka 3.x
const enumerate = (it) => shaka.util.Iterables.enumerate(it);
for (const {item: timePoint, next} of enumerate(timePoints)) {

This looks like an absolutely catastrophic refactoring idea.

avelad · 2022-03-24T17:56:47Z

@joeyparrish / @theodab , can you check this? If so, I think it should be changed to improve performance.

avelad · 2022-03-25T07:33:56Z

I have created #4064 to fix these issues.

philippe-elsass-deltatre · 2022-03-25T07:52:55Z

Amazing @avelad - I was precisely doing these changes locally to see if it was fixing the issue :D

avelad · 2022-03-25T08:41:05Z

@philippe-elsass-deltatre Have you seen performance improvements with the fix?

philippe-elsass-deltatre · 2022-03-25T09:09:14Z

@avelad when profiling with Chrome browser there is a 10x improvement at least.

philippe-elsass-deltatre · 2022-03-25T09:18:21Z

However I'm seeing a worrying climbing of the memory

avelad · 2022-03-25T09:28:55Z

Have you seen any memory leak?

philippe-elsass-deltatre · 2022-03-25T09:42:59Z

There seem to be a massive memory leak: even after unloading Shaka I see memory climbing and if I pause JS I end up in a setTimeout callback looking at variants/codecs/decodingInfos

avelad · 2022-03-25T09:48:27Z

And is it caused by this change or is it reproduced in the master branch?

philippe-elsass-deltatre · 2022-03-25T10:00:26Z

I don't know that's odd - the player seem to be leaking if I build from source tag v3.3.2. Maybe I'm doing it wrong.

philippe-elsass-deltatre · 2022-03-25T10:18:26Z

Is there something faster than all.py to build the library? build.py doesn't seem to produce the shaka-compiled.js

philippe-elsass-deltatre · 2022-03-25T10:34:43Z

No I'm good, building from source is fine for v3.3.2 and main - doesn't seem to leak.

philippe-elsass-deltatre · 2022-03-25T10:46:34Z

Ok finally your branch doesn't leak - I did have something wrong.

Performance-wise with release JS the branch is about 40% faster - I'll try that on device now.

philippe-elsass-deltatre · 2022-03-25T11:01:00Z

So... I under-estimated the penalty of the debug JS and my alarming "10s" on LG was a lot due to being a debug build (I couldn't find where to add timings otherwise).

Still the branch brings is a major improvement on LG device.

philippe-elsass-deltatre · 2022-03-25T11:20:09Z

Unfortunately Shaka 2.5.9 is still MUCH faster. Our LGs stutter with a live stream with 4h DVR with Shaka 3 (even optimised) while Shaka 2 handles it without a sweat.

avelad · 2022-03-25T11:24:21Z

The good thing is that the fix is good, although it does not improve everything that it should, but it would be necessary to see what other parts can be optimized.

joeyparrish · 2022-03-25T17:23:50Z

Thank you, @avelad, for fixing this! Should we leave this issue open for further investigation, or should PR #4064 close it?

avelad · 2022-03-25T18:08:07Z

I think that PR is an improvement but this issue should keep open for more investigation

See: #4062 (comment)

joeyparrish · 2022-03-25T20:56:52Z

I have done some analysis and have concluded that ES6 generators, even when not transpiled to ES5, are monstrously slow. We use them in two methods of shaka.util.Iterables and in two places in lib/cea/.

Here's a comparison of three ways to iterate through a large array of 100k items:

Using shaka.util.Iterables.enumerate as transpiled to ES5
Using shaka.util.Iterables.enumerate in raw ES6
Using a version of shaka.util.Iterables.enumerate in raw ES6 with the generator removed

The results are shocking. The first two methods are similar (159.8 runs / second vs 176.2 runs / second). So the compiler does not seem to add much in the way of overhead (around 10%).

But the third method, which is only two lines difference from the second, runs at 3141.2 runs / second, or 17.8x faster.

So we need to get rid of all generator usage and ban it at the compiler level.

To reproduce these results, open https://jsbench.me/jnl16wbdor/1 and click "Run" at the top. Keep the tab in the foreground, or else the results will be thrown off by throttling in the browser.

joeyparrish · 2022-03-25T22:14:42Z

@philippe-elsass-deltatre, please try this build which removes all generators (and makes no other changes):

commit: joeyparrish@810df3d
build w/o UI:
shaka-player.compiled.js.txt
build w/ UI: shaka-player.ui.js.txt

If that is a meaningful improvement in performance for you, we can pursue something like this as a solution. (Though this is just a drop-in, API compatible replacement of generators with returned Arrays. We may want to evaluate individual cases and make more changes like @avelad's.)

philippe-elsass-deltatre · 2022-03-26T08:54:33Z

Nice - I've never been a big fan of generators but now I have an excuse :D

One thing I notice in Shaka 3 VS Shaka 2 is MUCH larger times spent in GC. An enumerate function returning Arrays is ok for simple cases, but generally causes more garbage to be generated. One code part I found suspicious (but happened to be the same in v2) was the MpdUtils.inheritAttribute which creates multiple arrays where an old-school if/else would create less garbage.

joeyparrish · 2022-05-23T16:32:56Z

Looks like the release bot closed this issue when it shouldn't have.

cristian-atehortua · 2022-08-12T14:28:35Z

Hi, just looking at this issue because we are facing a big rebuffering problem on webOS devices 3.x and 4.4.
Our live manifests have a 4H DVR window and because of that Audio tracks on it are represented by a SegmentTimeline with around 2500 Segments. That is because each audio segment's duration differs from the next.

I saw the function that is wasting more time to execute is MpdUtils.createTimeline and a big part of that time is what the system waste on getAtribute for each attribute of each segment.

I was thinking that a good approach to solve the issue is to make this processing on demand, only when the segment is about to be used. I know this implies a big refactor on the way SegmentIndex works but maybe you can take this as a reference to improve it. What do you think?

joeyparrish · 2022-08-12T20:04:23Z

Thanks for the hint! Yes, I think you're right that this would be a big refactor to SegmentIndex. Perhaps we can find a new way to abstract SegmentReference to make this work. A parse-on-demand structure would require us to keep the XML elements in memory, but perhaps this would still be a win.

avelad · 2023-02-09T09:35:22Z

I think the problem has been mitigated with the latest versions. can you check it?

cristian-atehortua · 2023-02-10T19:03:23Z

Hi @avelad, I didn't look to perform better in the latest versions.

We found that this has a special impact when a live manifest has more than one audio track. It looks like Shaka parses each AdaptationSet even if that audio track is not selected. Maybe a "quick" win could be to delay parsing of the audio tracks until they're selected. What do you think @joeyparrish?

theodab · 2023-02-14T08:18:34Z

We already added support for delaying the parsing of HLS media playlists, so some of the work for variants not fully loading until chosen is already done.
I'm still not sure if the required changes to the DASH parser would be "quick", though, even with that. When we changed the HLS parser that way, we were finding new bugs with it for weeks.

cristian-atehortua · 2023-02-14T13:21:37Z

Well, based on the workaround we applied I think delaying could be a practical solution.

To workaround on the issue we preprocess the manifest and we just keep a single audio track, deleting completely the AdaptationSets that don't match the user's language preference. This is a feature loss but we had to do it because if not, our streams will not be playable. I think we can avoid the feature loss if Shaka optimize the way it process the manifest.

Just to clarify, this happens with our live streams that has the representation with SegmentTimeline and only on WebOS v3 and v4 since this platforms looks to be very limited in resources.

theodab · 2023-02-14T20:17:00Z

Well, I don't think it will have memory savings as great as preprocessing the manifest by removing entire tracks. We will need to keep the stuff we haven't parsed in memory, after all, so it can be parsed later. But I think it's probably worth a try.

I have other higher-priority issues I need to get to first, though, so I won't be able to work on this right away.

avelad · 2023-07-05T13:04:39Z

I think the problem has been mitigated in the main branch (see f1c5a1c), can you check it? Thanks!

github-actions · 2023-07-12T13:13:25Z

Closing due to inactivity. If this is still an issue for you or if you have further questions, the OP can ask shaka-bot to reopen it by including @shaka-bot reopen in a comment.

philippe-elsass-deltatre added the type: bug Something isn't working correctly label Mar 24, 2022

github-actions bot added this to the v3.3 milestone Mar 24, 2022

avelad mentioned this issue Mar 25, 2022

fix(dash): Fix performance regression #4064

Merged

joeyparrish added type: performance A performance issue priority: P1 Big impact or workaround impractical; resolve before feature release component: DASH The issue involves the MPEG DASH manifest format labels Mar 25, 2022

joeyparrish pushed a commit that referenced this issue Mar 25, 2022

fix(dash): Fix performance regression (#4064)

298b604

See: #4062 (comment)

joeyparrish assigned avelad Mar 25, 2022

avelad mentioned this issue May 3, 2022

Shaka Player Webos loss of performance to a standstill #3890

Closed

avelad modified the milestones: v3.3, v4.0 May 4, 2022

joeyparrish reopened this May 23, 2022

joeyparrish modified the milestones: v4.0, Backlog May 23, 2022

joeyparrish self-assigned this May 23, 2022

github-actions bot mentioned this issue Aug 24, 2022

chore(main): release 5.0.0 dave-nicholas/shaka-player#1

Open

avelad mentioned this issue Sep 28, 2022

Consider migration to goog.module #2647

Closed

github-actions bot mentioned this issue Oct 17, 2022

chore(main): release 5.0.0 joeyparrish/shaka-player#10

Closed

avelad removed their assignment Nov 24, 2022

avelad added the status: waiting on response Waiting on a response from the reporter(s) of the issue label Feb 9, 2023

github-actions bot removed the status: waiting on response Waiting on a response from the reporter(s) of the issue label Feb 10, 2023

avelad added the status: waiting on response Waiting on a response from the reporter(s) of the issue label Jul 5, 2023

github-actions bot closed this as completed Jul 12, 2023

github-actions bot removed the status: waiting on response Waiting on a response from the reporter(s) of the issue label Jul 12, 2023

avelad modified the milestones: Backlog, v4.4 Aug 28, 2023

shaka-bot added the status: archived Archived and locked; will not be updated label Sep 10, 2023

shaka-project locked as resolved and limited conversation to collaborators Sep 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DASH processManifest performance regression in 3.x #4062

DASH processManifest performance regression in 3.x #4062

philippe-elsass-deltatre commented Mar 24, 2022 •

edited

Loading

philippe-elsass-deltatre commented Mar 24, 2022

avelad commented Mar 24, 2022

avelad commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022 •

edited

Loading

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022 •

edited

Loading

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022

joeyparrish commented Mar 25, 2022

avelad commented Mar 25, 2022

joeyparrish commented Mar 25, 2022

joeyparrish commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 26, 2022

joeyparrish commented May 23, 2022

cristian-atehortua commented Aug 12, 2022 •

edited

Loading

joeyparrish commented Aug 12, 2022

avelad commented Feb 9, 2023

cristian-atehortua commented Feb 10, 2023

theodab commented Feb 14, 2023

cristian-atehortua commented Feb 14, 2023

theodab commented Feb 14, 2023

avelad commented Jul 5, 2023

github-actions bot commented Jul 12, 2023

DASH processManifest performance regression in 3.x #4062

DASH processManifest performance regression in 3.x #4062

Comments

philippe-elsass-deltatre commented Mar 24, 2022 • edited Loading

philippe-elsass-deltatre commented Mar 24, 2022

avelad commented Mar 24, 2022

avelad commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022 • edited Loading

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 25, 2022 • edited Loading

philippe-elsass-deltatre commented Mar 25, 2022

avelad commented Mar 25, 2022

joeyparrish commented Mar 25, 2022

avelad commented Mar 25, 2022

joeyparrish commented Mar 25, 2022

joeyparrish commented Mar 25, 2022

philippe-elsass-deltatre commented Mar 26, 2022

joeyparrish commented May 23, 2022

cristian-atehortua commented Aug 12, 2022 • edited Loading

joeyparrish commented Aug 12, 2022

avelad commented Feb 9, 2023

cristian-atehortua commented Feb 10, 2023

theodab commented Feb 14, 2023

cristian-atehortua commented Feb 14, 2023

theodab commented Feb 14, 2023

avelad commented Jul 5, 2023

github-actions bot commented Jul 12, 2023

philippe-elsass-deltatre commented Mar 24, 2022 •

edited

Loading

avelad commented Mar 25, 2022 •

edited

Loading

philippe-elsass-deltatre commented Mar 25, 2022 •

edited

Loading

cristian-atehortua commented Aug 12, 2022 •

edited

Loading