| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by LInuxFedora 5716 days ago
	C'mon they are distinguishable. can you differentiate between λ and ø .

2 comments

nuclear_eclipse 5716 days ago

But can you quickly and easily distinguish between Ͱ and ͱ, or even Ω and Ω? The last two really are different Unicode characters, the first is Greek capital Omega, the second is the Ohm symbol...

link

WalterBright 5716 days ago

And the first two are rendered in my browser as a square box because the font doesn't support them. Useless.

Mobile webkit?

Firefox on Vista renders as boxes.

link

wmf 5716 days ago

I see why that's a problem for domain names, but not for programming languages. You just define that one character has meaning and all its homoglyphs are syntax errors (even in identifier names).

link

akgerber 5716 days ago

But what characters are homoglyphs are determined by your choice of typeface.

link

jacobbijani 5716 days ago

Introduce a library or something written by someone else that tries to use clever and cute Unicode names, and imagine debugging that.

link

dedward 5716 days ago

Which is kind of weird, considering that the Ohm symbol is by definition the greek Omega - why did we use two separate unicode slots for them?

link

billswift 5715 days ago

I suspect they incorporated two different pre-existing symbol sets - the Greek alphabet and a set of scientific symbols.

Added - they did; here is an older version (3.0) of the standard online http://www.unicode.org/book/u2.html

link

ciupicri 5716 days ago

Do you have any idea why there are 2 different Unicode characters when on paper you would use the Greek Omega letter?

link

Hemospectrum 5716 days ago

Unicode is a superset of numerous other encodings, usually the national standards of the countries whose written languages they want to encode. For example, the sections for the Greek and Cyrillic alphabets have identical layouts to existing Greek-only and Russian-only encodings.

The exceptions to this policy are usually writing systems that didn't have standardized encodings at the time they were incorporated into Unicode. Also, the rather notorious CJK unification project resulted in several new overlapping layouts as Unicode policy changed over time with respect to Chinese characters.

link

mfukar 5716 days ago

That's another very interesting difficulty: the symbol for ohm is supposed to be the Greek capital omega. Resolving such ambiguities (where applicable, of course) would probably add a lot of complicated code, and I'm not 100% sure it's trivial for all similar cases.

link

kiujhgbnj 5716 days ago

But can you distinguish between a capital Beta and a Shluse?

link

llimllib 5716 days ago

Just for reference in your screen font: β and ß

link