Hacker News new | ask | show | jobs
by olssy 4499 days ago
I think it's actually CESU-8 encoding: http://www.unicode.org/reports/tr26/

I'm guessing it's implemented like this for performance reasons and calling it utf-8 was just a marketing ploy as everyone knows we should always use utf-8 for everything...