|
|
|
|
|
by rthnbgrredf
500 days ago
|
|
I'm still not convinced that this isn't a tokenizer issue. Were you able to find a substantial number of questions that do not fall into the letter countinh or word shuffling domsin - problems that are clearly unrelated to the fundamental tokenizer issue of modern LLMs? Otherwise, I would argue that your paper simply proves that the issue still exists. |
|
Getting to 100% may require tokenization innovation, sure.