Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问一下爬取评论是按照时间顺序还是热度的顺序爬取的? #533

Open
Alanhahaer opened this issue Jan 8, 2025 · 4 comments

Comments

@Alanhahaer
Copy link

5206aabe62ce22f8ceabee48b3a4939c ![ffcaab64589693d99e73fa12fbfffb1c](https://github.com/user-attachments/assets/d0fc3b53-47f2-48a5-bc40-96ba1f69f419) 想请教下,假设我爬取500条,这500条是最新的500条评论,还是平台按照热度或者别的指标排在前面的500条?
@Alanhahaer
Copy link
Author

ffcaab64589693d99e73fa12fbfffb1c

@NanmiCoder
Copy link
Owner

评论没发控制结果的排序,平台没有提供。

@Alanhahaer
Copy link
Author

噢噢这样,谢谢。还有想问一下,能够设置爬取所有一级评论吗?还是说想要所有子评论的话,设置一个比较大的数就可以?

@qicaiyun
Copy link

噢噢这样,谢谢。还有想问一下,能够设置爬取所有一级评论吗?还是说想要所有子评论的话,设置一个比较大的数就可以?

同问这个问题,遇到个评论只有9k,我设置的max是2w,结果采集出来1.6w多条评论,里面很多重复的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants