Hacker News new | ask | show | jobs
by TomSwirly 1839 days ago
1. Unicode has no idea of "bytes" at all.

2. Only in the UTF-32 encoding, which no one uses, are all characters represented by 4 bytes.

Most sites today use UTF-8 where most characters on a standard keyboard are 1 byte, and almost all characters from any language take 3 bytes.

3. Even 19 characters is a lot better than 12.