Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.2.6 #16

Merged
merged 9 commits into from
Nov 24, 2024
Merged

v0.2.6 #16

merged 9 commits into from
Nov 24, 2024

Conversation

D4Vinci
Copy link
Owner

@D4Vinci D4Vinci commented Nov 24, 2024

What's changed

New features

  • Now the PlayWrightFetcher can use the real browser directly with the real_chrome argument passed to the PlayWrightFetcher.fetch function but this requires you to have Chrome browser installed. Scrapling will launch an instance of your Chrome browser and you can use most of the options as normal. (Before you only had the cdp_url argument to do so)
  • Pumped up the version of headers generated for real browsers.

Bugs Squashed

  1. Turns out the format of the browser headers generated by BrowserForge was outdated which made Scrapling detected by some protections so now BrowserForge is only used to generate real useragent.
  2. Now the hide_canvas argument is turned off by default as it's being detected by Google's ReCaptcha.

Note

A friendly reminder that maintaining and improving Scrapling takes a lot of time and effort which I have been happily doing for months even though it's becoming harder. So, if you like Scrapling and want it to keep improving, you can help by supporting me through the Sponsor button.

@D4Vinci D4Vinci merged commit bbbc97a into main Nov 24, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant