| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by movedx01 61 days ago
	Probably the same way other models learned to surpass human ability while being bootstrapped from human-level data - using reinforcement learning. The question is, do we have good enough feedback loops for that, and if not, are we going to find them? I would bet they will be found for a lot of use cases.