Y
Hacker News
new
|
ask
|
show
|
jobs
by
pcwelder
236 days ago
I ϲаn guаrаntее thаt thе ОСR ϲаn't rеаd thіs sеntеnсе ϲоrrесtlу.
3 comments
syntaxing
236 days ago
What’s correct though? Even as a human, I read that “correctly”. Using weird representations of C doesn’t change the word?
link
LudwigNagasena
236 days ago
I would even say that OCR can rеаd the sеntеnсе ϲоrrесtlу, while a tokenizer can't.
link
kgeist
236 days ago
Qwen3 8b perfectly understood it after 14 seconds of thinking.
link
metalliqaz
236 days ago
Yeah OCR would be much more likely to read that sentence the way a human would.
link
bitdivision
236 days ago
A lot of Cyrillic characters:
https://apps.timwhitlock.info/unicode/inspect?s=I+%CF%B2%D0%...
link
geysersam
236 days ago
Really? How so?
link
moduspol
236 days ago
Looks like he’s using atypical “c” characters.
link