-
-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proper encoding of load_url #658
Comments
Referencing my previous issue #656 here. That issue concerned encoding of the url parameter in the query string of the request between the frontend and the backend. This issue concerns encoding of the same parameter in the request between the backend and the warcserver (as configured via the replay_url). |
Just to confirm, this was for use with OutbackCDX, right? Or some other configuration? |
We use our own indexer and loader backend https://github.com/nlnwa/gowarcserver. |
Our config looks something like:
|
Formatting of load_url does not encode the url parameter properly if it ends up in the query string of the configured url_field (replay_url):
pywb/pywb/warcserver/index/indexsource.py
Lines 160 to 162 in 843fe28
Some url's does not survive query parameter parsing unscaded when the url parameter is part of the query string of the load_url.
This seems to fix the issue:
I believe this is a proper fix without breaking changes, but I am not sure. Shall I post a PR?
The text was updated successfully, but these errors were encountered: