Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word list doesn't fulfill stated criteria/requirements #10

Open
sts10 opened this issue Oct 23, 2023 · 0 comments
Open

Word list doesn't fulfill stated criteria/requirements #10

sts10 opened this issue Oct 23, 2023 · 0 comments

Comments

@sts10
Copy link

sts10 commented Oct 23, 2023

I noticed this "Mandatory Criteria" section in the README of this project:

mnemonicode/README

Lines 28 to 37 in 315aed6

Mandatory Criteria:
- The wordlist contains 1626 words.
- All words are between 4 and 7 letters long.
- No word in the list is a prefix of another word (e.g. visit,
visitor).
- Five letter prefixes of words are sufficient to be unique.

Unfortunately, some of these promised qualities are no longer true of the current list.

I don't know how crucial these qualities of the list are for various purposes the list is used for, but I think it'd be good to update the information to reflect the current list.

  • The list has 1633 words on it, not 1626.
  • There are a handful of 3-letter words on the list, like "ego", "fax", "jet" and "ski". (Note that removing all 3-letter words leaves you with 1,626 words -- perhaps the original list size?)

Also, a number of words on the list share five letter prefixes with other words on the list, meaning they are not unique. Here are the ones I was able to find:

  • capital and capitan
  • content and context
  • domingo, dominic and domino
  • formal and format
  • justice and justin
  • parade and paradox
  • patrol and patron
  • plaster and plastic
  • polite and politic
  • postage and postal
  • profile and profit
  • protect and protein
  • static and station

Possible solutions

  1. Edit the README to match the current list. 1633 words; all words are between 3 and 7 letters long; and No word in the list is a prefix of another word (e.g. visit, visitor), which are all (still) true.
  2. Remove the 3 letters words from the list, making the 1st and 2nd criteria true. Still need to remove the statement that "Five letter prefixes of words are sufficient to be unique."
  3. Remove one of each of these pairs (and keep all existing 3-letter words on the list), we're down to 1,614 words. Then add 12 new words to get back to the originally promised 1626 words. You'd then only have to edit criteria to read "All words are between 3 and 7 letters long."
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant