Hacker News new | ask | show | jobs
by drra 69 days ago
Absolutely. Anyone working on inference token level knows how wasteful it all is especially in multimodal tokens.