Hacker News new | ask | show | jobs
by eutectic 3429 days ago
Is there a version of CFR for differentiable learners like neural networks?
1 comments

The averaging part makes it sound like the usual RL self-play against regular checkpoints of oneself.