| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Der_Einzige 574 days ago
	Everyone keeps claiming this but we have zero evidence of any kind of scaling wall what-so-ever. Oh you mean data? Synthetic Data, Agents, and Digitization solve that.

3 comments

anon373839 574 days ago

I disagree, but I also wasn’t referring to the exhaustion of training materials. I am referring to the fact that exponentially more compute is required to achieve linear gains in performance. At some point, it just won’t be feasible to do $50B training runs, you know?

link

throw5959 574 days ago

50B still seems reasonable compared to the revenue of the Big AI companies.

link

mentalgear 574 days ago

what revenues? If by big AI companies you mean llm service providers (OpenAI, ...), their revenues are far from high or profitable. https://www.cnbc.com/2024/09/27/openai-sees-5-billion-loss-t...

Maybe Nvidia, but they are a chip / hardware maker first. And even for them 50B training run with no exponential gains seems unreasonable.

Better to optimize the architecture / approach first, which also is what most companies are doing now before scaling out.

link

throw5959 573 days ago

It's not unusual to make infrastructure investments that will pay off in 30-50 years. I don't see why not an AI model - unless it's not true that we're at the end of scaling.

link

cubefox 574 days ago

There were multiple reports confirming that OpenAI's Orion (planned to be GPT-5) yielded unexpectedly weak results.

link

pegasus 573 days ago

And not just OpenAI is facing this problem. Anthropic and Google as well.

link

Der_Einzige 573 days ago

So Deepseek V3 did nothing to show you how wrong this take is?

link

UltraSane 573 days ago

And costs $500 million per training run.

link

UltraSane 573 days ago

There seems to be a affordable scaling wall.

link