Hacker News new | ask | show | jobs
by namibj 693 days ago
Are you offering to code that or donate compute for the RL training?

The problem is mostly that it's fairly intensive to code an efficient RL trainer for this, and even then it's expensive to run the training.

1 comments

Maybe it could be done distributed, in a similar way to the Leela Zero open source replication of Alpha Zero.