-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lua page script timeouts when trying to render binary pages #51
Comments
what happens? is it because timeout is not large enough to download a file, or is it a problem because Splash doesn't handle non-html splash:go? |
It's the latter - just plain
|
But it looks like it's not ANY binary content causes splash:go to fail, will try to narrow it down. |
Splash doesn't handle unsupported content now (http://doc.qt.io/archives/qt-5.5/qwebpage.html#forwardUnsupportedContent-prop), to fix it we need to add an API for that to Splash |
The link was extracted from |
When the page is not an html page but binary content (we can not know for sure when extracting links), the Lua script timeouts (even without HH enabled).
No only we do not download such pages, but this also slows down the whole crawl a lot.
The text was updated successfully, but these errors were encountered: