| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jakobnicolaus 2385 days ago
	We are entirely focused on the self-play setting in which the goal is to learn the highest performing policy for a team of agents all trained together. The Hanabi Challenge also outlines an ad-hoc setting in which you need to adjust to the diverse policies of other agents in the team on the fly.