Hacker News new | ask | show | jobs
Show HN: RS-Poker V5 The one with self learning multi-threaded async Agents (ottercrew.group)
3 points by eclark 2 days ago
rs-poker has been my passion project for a long time. This release is huge because it's the first time that I know of for an open source poker bot/agent implementation to include all the state of the art.

- tokio based async exploration - rust slab allocation based tree structure for regret minimization - perfect hashing for faster hand ranking - a TUI via https://ratatui.rs/

Creating your own poker bot and having them compete in an arena should be less than 100 lines of code: https://docs.rs/rs_poker/latest/rs_poker/arena/index.html

I need more eyes on the implementation, and more attempts to make the algorithms and agents state of the art. I know I can't have found the optimal configurations and algorithms; I'd love for the open source community to prove me wrong.

There's one glaring limitation that I need to fix. Right now the CFR agents can't predict their opponents hands so all regret minimization is either using the exact hand (so pretty conservative) or random (so too wide). I have some ideas here but I need more data and more discussion.

direct github: https://github.com/elliottneilclark/rs-poker

1 comments

is this pluribus ?
No this doesn't currently use a model for valuation. It uses recursive exploration(with a stockfish like early terminition) and keeps track of the reward and regret at each node. With brute force on number of cores and compact memory layout I was able to get reasonable convergence with 6 way rings.

I do have some plans to release a follow on with some ml components ala Pluribus.