Hacker News new | ask | show | jobs
by jonhohle 1210 days ago
It will be a fun day when Unicode crosses the 5-byte UTF-8 encoding threshold :/
2 comments

It won't. We settled on using stateful combining characters instead. (Remember when the selling point of switching the world to Unicode was "represent all writing systems with a single stateless 16 bit encoding"? Yeah, well, lol.)
Anything beyond four bytes is composed of multiple code points, happily