| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by singularity2001 109 days ago
	Superhuman chess engines are now trained just from one bit reward signal: win / lose. This says absolutely nothing about the complexity that the model develops inside. They even learned the rule of the games just from that reward.