What's changed
New features
- Now if you used the
wait_selector
argument withStealthyFetcher
andPlayWrightFetcher
classes, Scrapling will wait again for the JS to fully load and execute like normal. If you used thenetwork_idle
argument, Scrapling will wait for it again too after waiting for all of that. If the states are all fulfilled then no waiting happens, of course. - Now you can enable and disable ads on
StealthyFetcher
with thedisable_ads
argument. This is enabled by default and it installs theublock origin
addon. - Now you can set the locale used by
PlayWrightFetcher
with thelocale
argument. The default value is stillen-US
. - Now the basic requests done through
Fetcher
can accept proxies in this formathttp://username:password@localhost:8030
. - The stealth mode improved a bit for
PlayWrightFetcher
.
Bugs Squashed/Improvements
- Now enabling proxies on the
PlayWrightFetcher
class is not tied to thestealth
mode being on or off (Thanks to @AbdullahY36 for pointing that out) - Now the
ResponseEncoding
tests if the encoding returned from the response can be used with the page or not. If the returned encoding triggered an error, Scrapling defaults toutf-8
Note
A friendly reminder that maintaining and improving Scrapling
takes a lot of time and effort which I have been happily doing for months even though it's becoming harder. So, if you like Scrapling
and want it to keep improving, you can help by supporting me through the Sponsor button.