Replies: 1 comment 2 replies
-
|
This does seem to be a bug, because of ccc U+0323 should occur before U+030C |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
hello guys!
am I dumb or I don't understand normalization process at all?
I've got two sequences:
U+01C4 U+0323U+0044 U+005A U+030C U+0323if i take a look at UnicodeData.txt i see
and
as we can see, compat decomposition for
U+01C4will beU+0044 U+005A U+030Cam I right at that point?
let's add nonstarter
U+0323to non-decomposed and decomposedU+01C4.here we are again:
U+01C4 U+0323U+0044 U+005A U+030C U+0323QUESTION: if i normalize these strings with NFKD normalizer, should i receive the same result? because the test says the opposite.
here is a test:
and here is the result:
Beta Was this translation helpful? Give feedback.
All reactions