Hacker News new | ask | show | jobs
by wongarsu 26 days ago
> if not is there really a point to include everything possible in unicode ?

Needing to load three fonts to show a single document that mixes vastly different character sets is still infinitely better than not being able to have those different characters in the same .txt or .md file at all

> how many space is remaining for new alphabet and smileys ?

Unicode can encode about 1100k code points, and about 800k of those are currently unassigned and available for future scripts or characters

1 comments

Also, the 1.1M limit is because of UTF-16. If UTF-16 was deprecated in favor of UTF-8, the limit could be much higher.
We need UTF-32. For the future.
UTF-32 already exists, but nobody uses it because it’s much less efficient for most textual data than UTF-8.
UTFv6