Hacker News new | ask | show | jobs
by craigacp 1021 days ago
There's a correction to that tweet, larger vocab means fewer tokens for any given sequence (usually, assuming it's not to add other languages or character sets).