Hacker News new | ask | show | jobs
by onlyrealcuzzo 24 days ago
https://www.reddit.com/r/LocalLLaMA/comments/1gpr2p4/llms_co...

See Chart 13 here: https://www.rdworldonline.com/ais-great-compression-20-chart...

See here: https://epoch.ai/data-insights/llm-inference-price-trends

LLMs are so comically inefficient compared to the human brain that it is pretty easy to imagine this trend continuing for several more 90% drops.

If LeCun's JEPA or GRAM turn out to be a thing, we could see a 3-4 order of magnitude drop in a single release cycle / generation.

Keep in mind that performance per watt on the hardware side - at the same time - is still doubling every ~24 months - and this doesn't factor that in.