Y
Hacker News
new
|
ask
|
show
|
jobs
by
strangescript
340 days ago
A byte tokenized model is naturally 100% multi-lingual in all languages in its data set. There just isn't a lot of reason for teams to spend the extra training time to build that sort of model.