Hacker News new | ask | show | jobs
by strangescript 340 days ago
A byte tokenized model is naturally 100% multi-lingual in all languages in its data set. There just isn't a lot of reason for teams to spend the extra training time to build that sort of model.