|
|
|
|
|
by cmccabe
4948 days ago
|
|
Code points aren't letters. Consider the following sequence of code points:
U+0041 U+0308 [edit: corrected sequence] That equals this european letter: Ä Two code points, one letter. MAGIC! You can also get the same-looking letter with a single code point using U+00C4 (unicode likes redundancy). Not all languages have letters. Not all languages that have letters represent each one with a single code point. Please think twice before calling people "morons." |
|
Yes I under stand there are million ways to display the same shape using various unicode. But how does that make code point counting impossible?
AND if you explictly using COMBINING DIAERESIS instead of single U+00C4, counting diaeresis separately is wrong somehow?
Why don't we make a law stating that both ae and æ is single letter?