| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by BearOso 22 days ago
	Unless there's a new paradigm, scaling up is all they can do to improve performance. They've shrunk down all the way to 1-bit models and all the low-hanging fruit is gone. There's no way for them to get much smaller, so they have to get bigger and faster to meet expectations.

2 comments

intelkishan 22 days ago

This hasn’t been true for the past 2 years

link

oblio 22 days ago

Is this based on an assumption that Opus 4.7 & co are equivalent or smaller to Opus 4.5 & co? I highly doubt the advanced models (Opus, Pro, etc) aren't biggen than the standard ones (Sonnet, Flash, etc) and fairly sure newer models are bigger than older ones.

link

eldenring 22 days ago

this is just not true at all, there are massive leaps from algorithms, data, etc. every year. scale is one axis of many and you need to get them all correct.

link

BearOso 21 days ago

What novel data hasn't already been used in training? What new algorithms are there? Can you post some links so we can read about them?

link