-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Find out exact ordering of hiragana/katakana letters in native apple platforms #86636
Comments
Tagging subscribers to 'os-ios': @steveisok, @akoeplinger Issue DetailsWhile working on #85965 we find out that ordering of hiragana/katakana letters in native apple platforms is not so clear.
Find out what is the exact flow of ordering and update Contributes to #80689
|
There's no such thing as "case" (in the english/latin sense; I guess they might still be designated that way in unicode, but I somehow doubt it) for kana. Small characters are normally used to modify the sounds of "normal sized" characters. For instance:
You're not supposed to have small characters on their own - formally that doesn't make any sense.
I'm not sure why you're stating the ordering of hiragana/katakana flipped here, unless it's something specific to the original test data? Except looking at the unicode blocks there's a complete match, so it should be possible to do this via offset from start of block (Assuming an interleaved/phonetic ordering, rather than just as two separate blocks). Note that the two characters chosen as an example have the
The unicode block actually shows an equivalent entry for a hiragana character, I don't actually recognize everything in the blocks - they aren't in the commonly taught syllabary (at minimum when learning Japanese as a foreign language, but I don't think in Japanese schools either). I'm not sure of some of the uses of some of the characters. Also, as an additional wrinkle, for historical reasons there's a half-width katakana block. Note this block does not include the pre-combined versions of characters, and has an additional set of (half-width) combining characters. |
Thanks for the explanation. My bad for using lowercase/uppercase words, changed to small letter. I wanted to give examples where It is known that on |
@mkhamoyan @steveisok could you please triage this issue? Thanks! |
Further investigation reveals that the behavioral changes observed are a result of our approach to achieve results same to ICU in hybrid mode. |
While working on #85965 we find out that ordering of hiragana/katakana letters in native apple platforms is not so clear.
There are 3 cases
Letters that have small equivalent
For this case ordering works like
hiragana small letter < katakana small letter < hiragana letter < katakana letter
Letters without small equivalent
For this case ordering is
katakana letter < hiragana letter
but not sure it comes after small katakana letters or somewhere else.Letters only existing in katakana
Not sure where these letters are in ordering.
Find out what is the exact flow of ordering and update
hybrid-globalization.md
forOSX
platformCompareString
function details and add more test cases showing the ordering.Contributes to #80689
The text was updated successfully, but these errors were encountered: