| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by namibj 693 days ago
	Are you offering to code that or donate compute for the RL training? The problem is mostly that it's fairly intensive to code an efficient RL trainer for this, and even then it's expensive to run the training.

1 comments

Maybe it could be done distributed, in a similar way to the Leela Zero open source replication of Alpha Zero.