| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rdedev 1133 days ago
	It's pretty straightforward to build an RL environment for closed systems like chess but I don't think it's close enough for an AGI to learn. Like RLHF uses human feedback. Unless we come up with a way to scale that process AGI by this year doesn't seem possible