| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by wongarsu 26 days ago

> if not is there really a point to include everything possible in unicode ?

Needing to load three fonts to show a single document that mixes vastly different character sets is still infinitely better than not being able to have those different characters in the same .txt or .md file at all

> how many space is remaining for new alphabet and smileys ?

Unicode can encode about 1100k code points, and about 800k of those are currently unassigned and available for future scripts or characters

1 comments

xigoi 26 days ago

Also, the 1.1M limit is because of UTF-16. If UTF-16 was deprecated in favor of UTF-8, the limit could be much higher.

link

hulitu 25 days ago

We need UTF-32. For the future.

link

xigoi 25 days ago

UTF-32 already exists, but nobody uses it because it’s much less efficient for most textual data than UTF-8.

link

pvdebbe 25 days ago

UTFv6

link