| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by enahs-sf 53 days ago
	Occam’s razor tells me it’s probably because it’s not good. Perhaps running a company like survivor in a pressure cooker is not an effective management strategy.

2 comments

GoToRO 53 days ago

Also when you finally make it better, the others make theirs even better and you are still behind.

link

DonsDiscountGas 52 days ago

Seemed to work when it comes to selling ads. I'm thinking training LLMs is harder than anthropic and openai make it look

link

cyanydeez 52 days ago

I'm guessing both openai and anthropic have transitioned to prompt magic and fine tuning rather than try to keep building LLMs at scale. The fact that QWEN and other models are impressive, small and perfectly suitable for most work means every dollar you're spending on trying to train larger models is a losing prop.

link

vinni2 52 days ago

> every dollar you're spending on trying to train larger models is a losing prop

You probably don’t know how smaller models are trained then. Most of them are knowledge distilled or trained using data generated from larger models. If larger models are stopped there is no magical way smaller models will keep getting better.

link

cyanydeez 52 days ago

you're arguing with capitalism not science or engineering.

link

vinni2 51 days ago

Why don’t you argue with science and engineering then?

link