Hacker News new | ask | show | jobs
by pranshuchittora 29 days ago
Kinda similar, but this is token efficient. Each word is ~1 BPE token