| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sota_pop 411 days ago
	Yes, that does sound very similar. To my knowledge, isn’t that (effectively) how the latest DeepSeek breakthroughs were made? (i.e. by leveraging chatgpt outputs to provide feedback for training the likes of R1)