|
|
|
|
|
by refulgentis
757 days ago
|
|
> This is basically free to add, and there's no reason it shouldn't be made part of standard tokenization. This is muchhhhh different from how tokenization works today. Adding tokens to the vocabulary is free, everything outside that (i.e. string -> tokens) is going to be a major pain in the ass. Doable but annoying and error prone |
|