Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dict: Correct case for entries #890

Merged
merged 2 commits into from
May 30, 2024
Merged

dict: Correct case for entries #890

merged 2 commits into from
May 30, 2024

Conversation

hegotit
Copy link
Contributor

@hegotit hegotit commented May 29, 2024

No description provided.

@mirtlecn
Copy link
Collaborator

mirtlecn commented May 30, 2024

感谢

P.S. 为啥有这么多大小写错误,这词库仓库里面的英文词库之前是咋弄的

我翻了一遍,大多是人名、地名大小写错了。都没什么问题。

词典源的是应当是按一些书面词典的,把一些英文语料库出现过的口语连写缩写添加回去了,注释掉了语料库显示的低频词。

@mirtlecn mirtlecn merged commit c5b8efc into iDvel:main May 30, 2024
@hegotit hegotit deleted the enDict branch May 30, 2024 15:16
@hegotit
Copy link
Contributor Author

hegotit commented May 31, 2024

感谢

P.S. 为啥有这么多大小写错误,这词库仓库里面的英文词库之前是咋弄的

我翻了一遍,大多是人名、地名大小写错了。都没什么问题。

词典源的是应当是按一些书面词典的,把一些英文语料库出现过的口语连写缩写添加回去了,注释掉了语料库显示的低频词。

问下你说的英文语料库是Google的吗?

@mirtlebot
Copy link

ludwig 和 Linggle

@iDvel
Copy link
Owner

iDvel commented Jun 9, 2024

P.S. 为啥有这么多大小写错误,这词库仓库里面的英文词库之前是咋弄的

之前的数据来源没有大小写,是纯小写。
我记得那是一个寂寞的夜晚,我从头到尾简单捋了一遍,都是手动查阅的,漏掉了很多,一直没有二校。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants