CodePointInversionList JSON serialization cannot represent all code points #3892
Labels
C-unicode
Component: Props, sets, tries
help wanted
Issue needs an assignee
T-bug
Type: Bad behavior, security, privacy
CPIL cannot JSON-serialize a CPIL such as
[0-\uDFFF]
, because the end of the range is not a valid Rustchar
. We could check while serializing if such code points exist, and if so, fall back to theOldStyle
serialization for human-readable. We could also add escaping support toNewStyle
.This is an issue for transform rules such as InterIndic-Arabic that use sets like
$nonword = [^\uE000-\uE0FF];
The text was updated successfully, but these errors were encountered: