-
Notifications
You must be signed in to change notification settings - Fork 496
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dataset.xhtml and user input of special characters #8018
Comments
FWIW: Don found #4442 which looks like the same (but closed) issue. |
To reproduce on a Mac: Edit -> Emoji & Symbols, Customize List, enable Unicode, double-click |
Today we saw a case of |
This morning we removed a |
|
I can easily replicate the issue with a "big list of naughty strings" file I found at https://github.com/minimaxir/big-list-of-naughty-strings Steps:
Same XML error as above: XML Parsing Error: not well-formed Screenshot: |
I've been hacking around with a dataset with form feed ( This isn't a solution, of course, but I thought I'd mentioned it. I'm moving on to detecting special characters and providing a way to remove them.
@scolapasta helped me find the above by digging around in Chrome devtools. Some screenshots: |
Per standup the plan is something like this:
V4.11.0.1__5565-sanitize-directory-labels.sql seems to have a good example to start with:
|
Having just been bitten, a pre-emptive note - presumably the release notes on this would need to recommend re-index as well to get bad values out of solr? |
remove invalid characters from input #8018
dataset.xhtml in Dataverse-5.5 allows any old user to insert special characters into dataset metadata on dataset creation, which render the dataset metadata un-editable thereafter. On our instance nothing is written to server.log; instead the browser console conveys the rendering complaint when the user attempts to edit metadata. Here's a sample broken dataset on demo.dataverse.org:
Tough to say which UTF characters render the XML invalid, but it would be nice to sanitize them on user input.
The text was updated successfully, but these errors were encountered: