| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by antirez 508 days ago
	LLMs using CoT are also decoder-only, it's not a paradigm shift like people want to claim now to don't say they were wrong: it's still next token prediction, that is forced to explore more possibilities in the space it contains. And with R1-Zero we also know that LLMs can train themselves to do so.