-
Notifications
You must be signed in to change notification settings - Fork 292
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Coke/clean spell #2662
Coke/clean spell #2662
Conversation
Good idea. A couple of things:
Thoughts? |
In principle, I think this is a great idea. But it means we're adding a new utility to the perl6/doc repository, which is going to be downloaded every single time someone installs the module, and this is something (along with pretty much the rest of this directory) that is interesting mostly for us, as in people who are actively working on the repo. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
With the caveat in the other comment.
Regarding sizes, the new script file is dwarfed by the list of words we're already tracking; Making a slim download of just the docs as a release seems like a solution to this problem. Regarding failures - I apparently had a local FYI, "searchable" is removed because it's in the dictionary and doesn't need to be in the list. You can see it's still present in |
It's not so much about sizes, it's about having something else in the repo, with commit history, maintenance commitments, and so on. Having all the things related to spelling check, in which only maintainers are interested, is a bit too much for the regular user who just wants to download the documentation. For the time being I think it's OK to accept the PR, but all the spelling-related things are a serious candidate for spinning off the repository, at least IMHO. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for clarifying @coke.
Nit: noted two lines with trailing whitespace.
util/clean-spell
Outdated
|
||
if +$promise.status { | ||
say "Can't find $dict/$word anywhere, kill it."; | ||
$keep = False; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Trailing whitespace.
util/clean-spell
Outdated
$keep = False; | ||
} else { | ||
say "aspell test failed, keep it"; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Trailing whitespace.
After recent fixes to the spell checker, we have a lot of "words" that aren't needed. Also finding some that just aren't used anymore. Also finding some that are included in the dictionary now.
Maybe: * Now in dictionary * Now unused * No longer mistakenly marked as incorrect by xt/aspell.t
What's the ticket for the issue of user downloads of docs? Would like to add some notes there. |
whitespace fixed |
The two remaining issues I see based on comments Might have to add words back in - this is an ongoing battle regardless; updates to the dictionary files are made quite often. We won't have to go through this exercise of cleaning very often. If we want to run it again with just code.pws, that's a simple modification to the script, we can do that. Regarding the thread about utilities moving outside the repo, that can be handled in the context of #2542. Merging, thanks. |
The problem
Words don't get cleared out of the dictionary files, which may be needed in a few cases, like
Solution provided
Script to test if it's OK to remove each word. Does a little more work than necessary.
Update dictionary files to remove all extras (214).
PR so someone with a different aspell installation can verify there's nothing weird about my local english dictionary.