-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
use unicode_script crate to check chars are Latin script inside is_english_lingual
#504
Conversation
The build errors are something to do with "manifest keys"?? |
When you add a dependency, Also, would you mind making sure your commits match the practices laid out in our documentation? |
Hey @hippietrail, I feel pretty confident merging this. Could you give me a couple examples that replicate the issue so I can add them as test cases? |
|
Co-authored-by: hippietrail <[email protected]>
I should note two things:
|
This MR contains the following updates: | Package | Update | Change | |---|---|---| | [Automattic/harper/harper-ls](https://github.com/Automattic/harper) | patch | `v0.18.0` -> `v0.18.1` | MR created with the help of [el-capitano/tools/renovate-bot](https://gitlab.com/el-capitano/tools/renovate-bot). **Proposed changes to behavior should be submitted there as MRs.** --- ### Release Notes <details> <summary>Automattic/harper (Automattic/harper/harper-ls)</summary> ### [`v0.18.1`](https://github.com/Automattic/harper/releases/tag/v0.18.1) [Compare Source](Automattic/harper@v0.18.0...v0.18.1) #### What's Changed - build(deps): bump [@​sveltepress/theme-default](https://github.com/sveltepress/theme-default) from 5.0.5 to 5.0.7 in /packages by [@​dependabot](https://github.com/dependabot) in Automattic/harper#519 - build(deps-dev): bump eslint-config-prettier from 9.1.0 to 10.0.1 in /packages by [@​dependabot](https://github.com/dependabot) in Automattic/harper#518 - build(deps-dev): bump vite-plugin-top-level-await from 1.4.1 to 1.4.4 in /packages by [@​dependabot](https://github.com/dependabot) in Automattic/harper#516 - build(deps-dev): bump esbuild from 0.20.2 to 0.24.2 in /packages by [@​dependabot](https://github.com/dependabot) in Automattic/harper#517 - build(deps-dev): bump flowbite from 1.8.1 to 3.0.0 in /packages by [@​dependabot](https://github.com/dependabot) in Automattic/harper#515 - technical terms and popular names/websites by [@​MohamedAbdeen21](https://github.com/MohamedAbdeen21) in Automattic/harper#522 - fix(core): `AnA` linter did not recognize capital articles by [@​elijah-potter](https://github.com/elijah-potter) in Automattic/harper#521 - use unicode_script crate to check chars are Latin script inside `is_english_lingual` by [@​hippietrail](https://github.com/hippietrail) in Automattic/harper#504 - sort and add to list of holidays by [@​hippietrail](https://github.com/hippietrail) in Automattic/harper#509 - fix(core): `RepeatedWords` now detects repeated `and` tokens by [@​elijah-potter](https://github.com/elijah-potter) in Automattic/harper#520 - `harper-core` Documentation Updates by [@​elijah-potter](https://github.com/elijah-potter) in Automattic/harper#513 - feat: improve workflow for harper.js by [@​Asuka109](https://github.com/Asuka109) in Automattic/harper#526 - Add more languages by [@​elijah-potter](https://github.com/elijah-potter) in Automattic/harper#495 #### New Contributors - [@​MohamedAbdeen21](https://github.com/MohamedAbdeen21) made their first contribution in Automattic/harper#522 - [@​Asuka109](https://github.com/Asuka109) made their first contribution in Automattic/harper#526 **Full Changelog**: Automattic/harper@v0.18.0...v0.18.1 </details> --- ### Configuration 📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined). 🚦 **Automerge**: Disabled by config. Please merge this manually once you are satisfied. ♻ **Rebasing**: Whenever MR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this MR and you won't be reminded about this update again. --- - [ ] <!-- rebase-check -->If you want to rebase/retry this MR, check this box --- This MR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate). <!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzOS4xMzcuMiIsInVwZGF0ZWRJblZlciI6IjM5LjEzNy4yIiwidGFyZ2V0QnJhbmNoIjoibWFpbiIsImxhYmVscyI6WyJSZW5vdmF0ZSBCb3QiXX0=-->
I noticed
is_english_lingual()
doesn't check that words are in the Latin script so it would match any 'alphabet' except Chinese, Japanese, and Korean.Unicode character properties are finer-grained than Unicode blocks. And if new extension blocks are added you don't have to modify the code. I left the other logic in there but some of it may now be redundant.
Note that typst, which I don't know much about but seems to have some link to Harper also uses the
unicode_script
crate.