Hacker News new | ask | show | jobs
by Jackobrien 2 hours ago
I see a world soon where there’s an extremely wide variety of small models for speculative decoding, unique to use cases, companies, and even individuals.
2 comments

Hopefully that is the case and hardware does not get impossible to get.
yes, heavily constrained by sophisticated guardrails.

this is definitely where things are going. the enormous "eat the world" models have extreme diminishing returns by comparison.