Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems detecting words in hiragana #2105

Open
Wend1go opened this issue Nov 19, 2024 · 3 comments
Open

Problems detecting words in hiragana #2105

Wend1go opened this issue Nov 19, 2024 · 3 comments

Comments

@Wend1go
Copy link

Wend1go commented Nov 19, 2024

I noticed that 10ten has some issues with Hiragana like in this sentence from an NHK headline:
アメリカ トランプさんがまた大統領になることになった
("nina" - "naruko" instead of "ninaru")
Maybe it would be possible to extend or shrink the selection using the mouse wheel by one letter per turn while holding shift or another key.

Maybe related to #1969

@birtles
Copy link
Member

birtles commented Nov 19, 2024

Hi! Thanks for the bug report!

Is the issue that 10ten doesn't recognize になる? If so, I think that's just because the source dictionary (JMdict) doesn't have an entry for になる. It only contains the verb なる without the に particle.

When looking up from な onwards it will show なるこ before なる(成る) because it's longer and it doesn't do any word segmentation, but just looks up from where the cursor is.

Does that make sense or have I misunderstood?

@Wend1go
Copy link
Author

Wend1go commented Nov 19, 2024

Yes, this makes sense.
So in order to get 10ten to notice "ninaru", the underlying dictionary needs to be updated.

@birtles
Copy link
Member

birtles commented Nov 20, 2024

Yes, that's right. However, I don't think JMdict (the source) is in the habit of listing combinations of particles and verbs (since the particle will depend on the preceding context).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants