| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by zozbot234 5 hours ago
	Among other things, because you simply can't get those "massive amounts" of text from a SOTA model at reasonable cost. And complex reasoning cannot possibly be trained in a pure one-shot fashion, real post-training takes massive resources. The whole story doesn't add up.