Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable regular expression support based on whether UTF-8 is in the cu…
…rrent locale (#5776) * Regular expression support handling via UTF-8 in the locale Signed-off-by: Navin Kumar <[email protected]> * Fixup some tests, including a typo in transpiler unicode fuzz test Signed-off-by: Navin Kumar <[email protected]> * Update fuzz tests to not include \b or \B in fuzz testing because of known issues with unicode Signed-off-by: Navin Kumar <[email protected]> * Fix issue in fuzz tests with \Z followed by $ Signed-off-by: Navin Kumar <[email protected]> * Fix issue with word boundaries and negative character classes \D,\W,\S Signed-off-by: Navin Kumar <[email protected]> * Add reference to issue regarding \b and \B unicode issue Signed-off-by: Navin Kumar <[email protected]> * Fall back to CPU when negated character class is next to word boundary Signed-off-by: Navin Kumar <[email protected]> * Add \H and \V to fallback scenario with word boundaries Signed-off-by: Navin Kumar <[email protected]> * remove this test since it was removed in the upstream branch Signed-off-by: Navin Kumar <[email protected]> * move word boundary fuzz testing logic to a separate flag skipUnicodeIssues which will skip when testing full unicode characters but will use when using a smaller ASCII subset Signed-off-by: Navin Kumar <[email protected]> * Update the jenkins scripts here to set the locale Signed-off-by: Navin Kumar <[email protected]> * need to export LC_ALL in mvn_verify stage here Signed-off-by: Navin Kumar <[email protected]> * add comment for LC_ALL Signed-off-by: Navin Kumar <[email protected]> * Regexp compatibility doc update Signed-off-by: Navin Kumar <[email protected]> * Update scalatests and premerge build script Signed-off-by: Navin Kumar <[email protected]> * update build scripts to test regexp separately from other tests because of locale requirement Signed-off-by: Navin Kumar <[email protected]> * Feedback: code cleanup Signed-off-by: Navin Kumar <[email protected]> * Fix syntax errors in RegularExpressionSuite that prevent it from loading in non-UTF-8 environments Signed-off-by: Navin Kumar <[email protected]> * register custom regexp mark Signed-off-by: Navin Kumar <[email protected]> * updates to build script and test script Signed-off-by: Navin Kumar <[email protected]> * revert the nightly build script updates Signed-off-by: Navin Kumar <[email protected]>
- Loading branch information