The whitelist does not work when words have mixed case #1860

xsmq · 2021-01-21T02:09:20Z

1. Unused whitelist

$ codespell -q 7 mindspore/mindspore/lite/tools/converter/parser/tf
mindspore/mindspore/lite/tools/converter/parser/tf/tf_merge_parser.cc:39: MergeT ==> merge
mindspore/mindspore/lite/tools/converter/parser/tf/tf_activation_parser.cc:66: shoud ==> should

2. Add whitelist

(1) MergeT

$ cat codespell.txt
MergeT
shoud

$ codespell -q 7 -I codespell.txt mindspore/mindspore/lite/tools/converter/parser/tf
mindspore/mindspore/lite/tools/converter/parser/tf/tf_merge_parser.cc:39: MergeT ==> merge

(2) nNumber

$ cat codespell.allow
MergeT
nNumber

$ codespell -q 7 -I codespell.allow mindspore/mindspore/ccsrc/minddata/dataset/engine/connector.h
mindspore/mindspore/ccsrc/minddata/dataset/engine/connector.h:151: nNumber ==> number
mindspore/mindspore/ccsrc/minddata/dataset/engine/connector.h:152: nNumber ==> number

(3) REALEASE

$ cat codespell.allow
pyhton
REALEASE

$ codespell -q 7 -I codespell.allow mindspore/mindspore/lite/examples/train_lenet/README.md
mindspore/mindspore/lite/examples/train_lenet/README.md:67: REALEASE ==> RELEASE
mindspore/mindspore/lite/examples/train_lenet/README.md:68: betweeen ==> between
mindspore/mindspore/lite/examples/train_lenet/README.md:72: followings ==> following
mindspore/mindspore/lite/examples/train_lenet/README.md:78: paramaters ==> parameters

r3econ · 2021-04-26T12:39:27Z

I'm experiencing the same problem. Makes using this tool in production impossible

peternewman · 2021-04-26T13:26:13Z

Hi @xsmq ,

As mentioned to @r3econ in codespell-project/actions-codespell#29 , from the main codespell help ( https://github.com/codespell-project/codespell#readme ):

Important note: The list passed to -I is case-sensitive based on how it is listed in the codespell dictionaries.

2. Add whitelist

(1) MergeT

grep -iIR merget codespell_lib/data/
codespell_lib/data/dictionary.txt:merget->merge

(2) nNumber

$ cat codespell.allow
MergeT
nNumber

grep -iIR nnumber codespell_lib/data/
codespell_lib/data/dictionary.txt:nnumber->number

(3) REALEASE

$ cat codespell.allow
pyhton
REALEASE

grep -iIR ^realease codespell_lib/data/
codespell_lib/data/dictionary.txt:realease->release
codespell_lib/data/dictionary.txt:realeased->released
codespell_lib/data/dictionary.txt:realeases->releases

So you want these in your codespell.allow:

merget
nnumber
realease

xsmq · 2021-06-28T01:31:58Z

OK，thanks.

peterjc · 2021-08-29T22:49:43Z

I just struggled with this with an all capital term, having initially expected this to mean case sensitive to match the input file. That does seem a more intuitive behaviour - although a change to the tool.

peternewman · 2021-11-30T13:35:10Z

I just struggled with this with an all capital term, having initially expected this to mean case sensitive to match the input file.

@peterjc we'd welcome suggestions for how to make the existing help file more clear/less confusing:

Important note: The list passed to -I is case-sensitive based on how it is listed in the codespell dictionaries.

That does seem a more intuitive behaviour - although a change to the tool.

If it had to match the input file, then if I had this input:

SPELING TOOLS
There are lots of tools to catch spelings available

Then I'd need to add two ignore entries for the two cases that appear in the input.

I'm not entirely sure of the history of why we don't just do a case-insensitive comparison, but it would prevent some classes of typos being entered and therefore detected (e.g. names being in lower case).

peternewman added the question label Apr 26, 2021

xsmq closed this as completed Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The whitelist does not work when words have mixed case #1860

The whitelist does not work when words have mixed case #1860

xsmq commented Jan 21, 2021 •

edited

Loading

r3econ commented Apr 26, 2021

peternewman commented Apr 26, 2021

2. Add whitelist

(1) MergeT

(2) nNumber

(3) REALEASE

xsmq commented Jun 28, 2021

peterjc commented Aug 29, 2021

peternewman commented Nov 30, 2021

The whitelist does not work when words have mixed case #1860

The whitelist does not work when words have mixed case #1860

Comments

xsmq commented Jan 21, 2021 • edited Loading

1. Unused whitelist

2. Add whitelist

(1) MergeT

(2) nNumber

(3) REALEASE

r3econ commented Apr 26, 2021

peternewman commented Apr 26, 2021

2. Add whitelist

(1) MergeT

(2) nNumber

(3) REALEASE

xsmq commented Jun 28, 2021

peterjc commented Aug 29, 2021

peternewman commented Nov 30, 2021

xsmq commented Jan 21, 2021 •

edited

Loading