Hacker News new | ask | show | jobs
by rrr_oh_man 476 days ago
Check this out if you're interested: https://www.youtube.com/watch?v=sTzp76JXsoY (Tom Scott, 3min, "Why Do Flag Emoji Count As Two Characters?")

Or even more in depth: https://www.youtube.com/watch?v=mubfp9WYvvI (Jennifer Lee, 55min, "Emoji by the People, For the People, a CS50 tech talk")

1 comments

Yes, but it raises the question of what exactly the "character" count is supposed to mean.

Does anyone need a "UTF-16 code unit count"? Somehow I don't think so. So it should really be counting graphemes, in my opinion.

I don't think anyone was arguing this point, really.
I am arguing this point, because it’s a huge flaw in the premise of the tool in question.
This is probably just someone’s weekend passion project. The issue looks more like an edge case than something the creator is super attached to.

Let’s be kind. Everyone’s just doing their best. (I know I also come off too harsh too often, especially online.)

This !

But I implemented Twitter and SMS accurate count thanks to all the comment, win/win!

Agreed! FWIW I was doing more free QA than nit-picking ;)