Add support for channel tabs #951

Theta-Dev · 2022-10-23T22:55:19Z

I carefully read the contribution guidelines and agree to them.
I have tested the API against NewPipe.
I agree to create a pull request for NewPipe as soon as possible to make it compatible with the changed API.

YouTube is planning to update their channel page layout (as announced here) and divide videos into 3 tabs (regular videos, shorts, livestreams).

I did open pull request #944 to address the updated video tab layout, but that would still leave shorts and livestreams inaccessible. So I decided to implement channel tabs for NewPipe that allow the extraction of additional content besides the main video list.

Channel tabs may contain any InfoItem, which makes it possible to not only extract video tabs but also playlists and featured channels. So this also resolves, for the extractor part, TeamNewPipe/NewPipe#2414.

I added support for the following services and tabs:

YouTube: Shorts, Livestreams, Playlists, Channels, About
Soundcloud: Playlists, Albums
Peertube: Channels, Playlists
Bandcamp: Albums

Closes #227

YouTube is currently A/B testing a new layout on their channel pages, which uses a RichGridRenderer.

- extract YouTube channel tabs: playlists, channels, shorts, live

- fix checkstyle errors

…annel-tabs

B0pol · 2022-11-02T13:16:33Z

Are YouTube music / topic channels supported? #501
We couldn't support them because of no tab support, and it's basically a channel with playlist tab only

1 minute was incorrectly parsed as 1s

…annel-tabs

…bExtractor

Stypox

Looks good to me. Now we just need to wait for AudricV's review and we are done. Thanks!

Theta-Dev · 2023-04-25T20:31:24Z

Having the same URL for different tabs will lead to cache conflicts. That's why I added the URL suffixes, even though they are not present on the actual Bandcamp page.
They redirect to the main page, though.

/**
     * Check if we can load it from the cache (forceLoad parameter), if we can't,
     * load from the network (Single loadFromNetwork)
     * and put the results in the cache.
     *
     * @param <I>             the item type's class that extends {@link Info}
     * @param forceLoad       whether to force loading from the network instead of from the cache
     * @param serviceId       the service to load from
     * @param url             the URL to load
     * @param infoType        the {@link InfoItem.InfoType} of the item
     * @param loadFromNetwork the {@link Single} to load the item from the network
     * @return a {@link Single} that loads the item
     */
    private static <I extends Info> Single<I> checkCache(final boolean forceLoad,
                                                         final int serviceId, final String url,
                                                         final InfoItem.InfoType infoType,
                                                         final Single<I> loadFromNetwork) {

Stypox · 2023-04-26T13:47:42Z

Mmmh, you are right, but then the two urls should be ".../tracks" and ".../albums", for consistency. Can you do this change and fix merge conflicts?

Theta-Dev · 2023-04-27T09:46:12Z

I changed the URL suffixes to track and album, as these both redirect to the main page instead of leading to a 404 error.

…annel-tabs

AudricV

First of all, before reading the review, don't be afraid: I don't ask you to change everything by yourself, I can do a part of my requested changes if you want.
As some code changes have been made by @Stypox, they should reply for comments on their changes, so you probably don't need to answer on every comment.
Note that a lot of review comments are asking for code style or formatting changes, so changes are easier that what you can think on a first look.

Thank you very much for your effort on this feature! The general structure and code changes look fine with the current extractor code, but improvements are definitively needed in the future (and that's what #904 does partially).

We need a list of API changes which will be added in the next extractor release, especially because this PR introduces breaking changes. It should be added in the PR description for easier access.

I noticed changes that should be made globally/in multiple files (some changes are not listed here but on review comments):

better documentation/addition of documentation on methods and classes, at least for the ones exposed in the API;
proper/better usage of Java's Stream API (I also highlighted this in a few review comments): when working on streams of JsonArrays, in most cases you filter/keep only objects of JsonObject's class instance then casting and working with these JsonObjects instead of doing the same process or parts of it multiple times (e.g. multiple casts). Here is an example:
```
myJsonArray.stream()
    .filter(JsonObject.class::isInstance)
    .map(JsonObject.class::cast)
    // Your operations here
```
on several tests, you are removing test Override annotations, replacing them by Test ones. Can you explain why are you doing this?

Finally, it seems that some commits have a message different of what they are doing. Could you fix that and also, if possible, switching to a rebase on our dev branch instead of using a lot of merge commits and fixing merge conflicts on the fly? It would make the commits of your branch, and so this PR, a lot cleaner in my opinion. Thank you in advance!

AudricV · 2022-11-09T19:06:15Z

extractor/src/main/java/org/schabi/newpipe/extractor/linkhandler/ChannelTabs.java

@@ -0,0 +1,12 @@
+package org.schabi.newpipe.extractor.linkhandler;
+
+public final class ChannelTabs {


You are using strings for channel tabs names, however this open the door to several NullPointerExceptions in your code. You should make sure that they are avoided as much as possible. Using an enum would have been a great idea, like described in the conversation beyond #951 (comment), but as it has been decided to not do so, don't change anything here.

I think a dedicated base (abstract) structure should be created for tabs, and this structure should be implemented in services, because different services may have different tab names for the same type of contents.

This structure should allow getting its name, what it contains (streams, playlists, channels, [...] or multiple type of contents) and maybe the service ID and/or more.

But as you rely on content and sort filters, which are currently strings, this is out of the scope of this PR. These filters should be refactored too (made in #904).

...org/schabi/newpipe/extractor/services/youtube/extractors/YoutubeStreamInfoItemExtractor.java

extractor/src/main/java/org/schabi/newpipe/extractor/channel/ChannelTabExtractor.java

extractor/src/main/java/org/schabi/newpipe/extractor/channel/ChannelTabInfo.java

...rg/schabi/newpipe/extractor/services/bandcamp/extractors/BandcampAlbumInfoItemExtractor.java

...ava/org/schabi/newpipe/extractor/services/youtube/extractors/YoutubeChannelTabExtractor.java

...c/test/java/org/schabi/newpipe/extractor/services/bandcamp/BandcampChannelExtractorTest.java

...c/test/java/org/schabi/newpipe/extractor/services/peertube/PeertubeAccountExtractorTest.java

.../test/java/org/schabi/newpipe/extractor/services/youtube/YouTubeChannelTabExtractorTest.java

Theta-Dev · 2023-05-01T14:42:18Z

on several tests, you are removing test Override annotations, replacing them by Test ones. Can you explain why are you doing this?

Tests do not seem to run when they are not annotated with @Test. I could add both annotations, though.

fix: Bandcamp channel link handler factory

…annel-tabs

use shared method for channel header extraction

extractor/src/main/java/org/schabi/newpipe/extractor/channel/ChannelInfo.java

FireMasterK

One thing I don't like about the YouTube implementation is that we fetch the videos tab, but we don't extract the StreamInfoItem contents from it, just the tabs.

This requires 2 (potentially the same) requests now to get the video tab information, which makes things a lot slower for Piped :/

Could we somehow have something like a default tab extracted if the service needs to extract a tab in order to get the available tabs?

AudricV · 2023-06-29T12:51:08Z

Please do not push any commits to the branch of this pull request, I am working on rebasing this branch on top of the dev branch and applying the remaining requested changes. Thanks!

AudricV · 2023-07-19T19:45:36Z

Closing in favor of #1082. Thank you very much for your work which you can find in this new, improved and updated PR of this one!

Theta-Dev added 10 commits October 12, 2022 15:29

fix: support richGridRenderer on channel page

ed4559d

YouTube is currently A/B testing a new layout on their channel pages, which uses a RichGridRenderer.

feat: add tab support to channel extractor

8b4b431

- extract YouTube channel tabs: playlists, channels, shorts, live

feat: add channel tabs

18e3758

fix: handle unsupported content, hide tab bar with < 2 tabs

78bbbd4

feat: prettier channel info page

9a9fae9

feat: add album tab

667ab2a

feat: add visitor data config option

57865e2

feat: add tab support for Peertube

aed685e

feat: add tab support for Soundcloud

53e772c

- fix checkstyle errors

fix: Peertube playlist urls

e6907ca

Theta-Dev mentioned this pull request Oct 23, 2022

Add support for channel tabs TeamNewPipe/NewPipe#9182

Merged

2 tasks

Merge branch 'dev' of github.com:TeamNewPipe/NewPipeExtractor into ch…

04c7e46

…annel-tabs

TobiGr added enhancement New feature or request multiservice Issues related to multiple services labels Oct 24, 2022

Theta-Dev added 3 commits October 24, 2022 10:29

fix: checkstyle errors

edaaaac

fix: store YouTube visitor data for channel tabs

1253773

feat: add Bandcamp album tab

94523ad

opusforlife2 mentioned this pull request Oct 25, 2022

[HELP NEEDED] Major planned/missing features TeamNewPipe/NewPipe#6448

Open

34 tasks

test: add channel tab extractor tests

a592c96

Theta-Dev added 10 commits November 2, 2022 19:07

fix: change playlist tab parameter to include YTM albums

f3b064a

fix: NPE when extracting YT stream items without duration

0a458d8

fix: channel shorts duration parsing

856584f

1 minute was incorrectly parsed as 1s

fix: channel short upload date parsing

7ec6a44

refactor: API changes

f71fdac

Merge branch 'dev' of github.com:TeamNewPipe/NewPipeExtractor into ch…

73c182f

…annel-tabs

fix: support new PlaylistInfoItem interface

abf0473

fix: rename channel tab LIVE to LIVESTREAMS

8a3545c

fix: link handler urls for tabs

7dba12b

fix: update mock data

f6d8652

Theta-Dev and others added 4 commits April 16, 2023 15:46

feat: fetch YT Shorts using internal playlist

6b627f8

refactor: merge YoutubeChannelTabExtractor and YoutubeChannelVideosTa…

2ad496f

…bExtractor

fix: remove overridden getId function in PeertubeAccountExtractor

0c5fdac

[Bandcamp] Use same url for tracks and albums channel tabs

6a38811

Stypox approved these changes Apr 25, 2023

View reviewed changes

Theta-Dev added 3 commits April 27, 2023 11:47

fix: add Bandcamp URL suffixes

d47d0f9

Merge branch 'dev' of github.com:TeamNewPipe/NewPipeExtractor into ch…

417b797

…annel-tabs

tests: add tests for channel tab urls

0e28f2b

AudricV requested changes Apr 29, 2023

View reviewed changes

Stypox mentioned this pull request May 1, 2023

Parsing playlists for a channel #1057

Closed

3 tasks

Theta-Dev added 8 commits May 1, 2023 17:38

tests: separate channel/tab tests for Peertube, Bandcamp, Soundcloud

0583515

fix: Bandcamp channel link handler factory

tests: add @OverRide to YT channel/tab tests

a3f6a7e

Merge branch 'dev' of github.com:TeamNewPipe/NewPipeExtractor into ch…

d868746

…annel-tabs

fix: use assertTabs method, rename channelTab mock folder

2adc2ca

fix: add TeamNewPipe#1050 fix to channel tab name extraction

e8fab3b

use shared method for channel header extraction

docs: add docs to ChannelTabInfo

b1f8905

refactor: remove getTab() method from ChannelTabExtractor

66d8038

fix: improve shorts duration parser

6c5a225

Theta-Dev requested a review from AudricV May 7, 2023 23:19

FireMasterK reviewed May 14, 2023

View reviewed changes

extractor/src/main/java/org/schabi/newpipe/extractor/channel/ChannelInfo.java Show resolved Hide resolved

FireMasterK reviewed May 14, 2023

View reviewed changes

AudricV mentioned this pull request Jul 19, 2023

Add support for channel tabs and channel tags #1082

Merged

3 tasks

AudricV closed this Jul 19, 2023

AudricV removed their request for review July 19, 2023 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for channel tabs #951

Add support for channel tabs #951

Theta-Dev commented Oct 23, 2022 •

edited by AudricV

Loading

B0pol commented Nov 2, 2022

Stypox left a comment

Theta-Dev commented Apr 25, 2023

Stypox commented Apr 26, 2023 •

edited

Loading

Theta-Dev commented Apr 27, 2023

AudricV left a comment

AudricV Nov 9, 2022

Theta-Dev commented May 1, 2023

FireMasterK left a comment

AudricV commented Jun 29, 2023

AudricV commented Jul 19, 2023

		@@ -0,0 +1,12 @@
		package org.schabi.newpipe.extractor.linkhandler;

		public final class ChannelTabs {

Add support for channel tabs #951

Add support for channel tabs #951

Conversation

Theta-Dev commented Oct 23, 2022 • edited by AudricV Loading

B0pol commented Nov 2, 2022

Stypox left a comment

Choose a reason for hiding this comment

Theta-Dev commented Apr 25, 2023

Stypox commented Apr 26, 2023 • edited Loading

Theta-Dev commented Apr 27, 2023

AudricV left a comment

Choose a reason for hiding this comment

AudricV Nov 9, 2022

Choose a reason for hiding this comment

Theta-Dev commented May 1, 2023

FireMasterK left a comment

Choose a reason for hiding this comment

AudricV commented Jun 29, 2023

AudricV commented Jul 19, 2023

Theta-Dev commented Oct 23, 2022 •

edited by AudricV

Loading

Stypox commented Apr 26, 2023 •

edited

Loading