Hacker News new | ask | show | jobs
by rrr_oh_man 479 days ago
Nice UI, indeed!

I posted the first thing on my clipboard (a YouTube URL) and, interestingly, it showed me

  Words: 1
  Sentences: 3

PS: thank you for not shoveling LLMs into this
2 comments

Interesting... I'll have a look at the "algo"

Yep no need to over engineer things here :)

i posted an emoji and it counted it as two characters
Check this out if you're interested: https://www.youtube.com/watch?v=sTzp76JXsoY (Tom Scott, 3min, "Why Do Flag Emoji Count As Two Characters?")

Or even more in depth: https://www.youtube.com/watch?v=mubfp9WYvvI (Jennifer Lee, 55min, "Emoji by the People, For the People, a CS50 tech talk")

Yes, but it raises the question of what exactly the "character" count is supposed to mean.

Does anyone need a "UTF-16 code unit count"? Somehow I don't think so. So it should really be counting graphemes, in my opinion.

I don't think anyone was arguing this point, really.
I am arguing this point, because it’s a huge flaw in the premise of the tool in question.
This is probably just someone’s weekend passion project. The issue looks more like an edge case than something the creator is super attached to.

Let’s be kind. Everyone’s just doing their best. (I know I also come off too harsh too often, especially online.)