Hacker News new | ask | show | jobs
by LInuxFedora 5716 days ago
C'mon they are distinguishable. can you differentiate between λ and ø .
2 comments

But can you quickly and easily distinguish between Ͱ and ͱ, or even Ω and Ω? The last two really are different Unicode characters, the first is Greek capital Omega, the second is the Ohm symbol...
And the first two are rendered in my browser as a square box because the font doesn't support them. Useless.
Mobile webkit?
Firefox on Vista renders as boxes.
I see why that's a problem for domain names, but not for programming languages. You just define that one character has meaning and all its homoglyphs are syntax errors (even in identifier names).
But what characters are homoglyphs are determined by your choice of typeface.
Introduce a library or something written by someone else that tries to use clever and cute Unicode names, and imagine debugging that.
Which is kind of weird, considering that the Ohm symbol is by definition the greek Omega - why did we use two separate unicode slots for them?
I suspect they incorporated two different pre-existing symbol sets - the Greek alphabet and a set of scientific symbols.

Added - they did; here is an older version (3.0) of the standard online http://www.unicode.org/book/u2.html

Do you have any idea why there are 2 different Unicode characters when on paper you would use the Greek Omega letter?
Unicode is a superset of numerous other encodings, usually the national standards of the countries whose written languages they want to encode. For example, the sections for the Greek and Cyrillic alphabets have identical layouts to existing Greek-only and Russian-only encodings.

The exceptions to this policy are usually writing systems that didn't have standardized encodings at the time they were incorporated into Unicode. Also, the rather notorious CJK unification project resulted in several new overlapping layouts as Unicode policy changed over time with respect to Chinese characters.

That's another very interesting difficulty: the symbol for ohm is supposed to be the Greek capital omega. Resolving such ambiguities (where applicable, of course) would probably add a lot of complicated code, and I'm not 100% sure it's trivial for all similar cases.
But can you distinguish between a capital Beta and a Shluse?
Just for reference in your screen font: β and ß