Hacker News new | ask | show | jobs
by drakenot 427 days ago
The cost of inference[0] for the same quality has been dropping by nearly 10x year over year. I’m not sure when that trend will slow down, but there’s still been a lot of low-hanging fruit around algorithmic efficiency.

[0] https://www.reddit.com/r/LocalLLaMA/comments/1gpr2p4/llms_co...