Hacker News new | ask | show | jobs
by bsder 416 days ago
TIL: In worst case, "20 UTF-8 bytes" == "1 Hindi character"

Going to have to remember that.

1 comments

You can go way beyond that, although at some point I think it's unlikely that the character is something that is semantically valid.