Hacker News new | ask | show | jobs
by LoganDark 1071 days ago
There are only 256 bytes. CJK characters can be produced by outputting these bytes in a certain order. LLMs are capable of outputting multiple tokens in order because even many words are multiple tokens each.