Hacker News new | ask | show | jobs
by mod 1800 days ago
You've picked two of the easier games to solve--heads up games. More specifically, heads up games with specific stack depth

There is no AI that can play well, for instance, in a 9-handed game with varying stack sizes, while itself and some competent players are 600BB deep and some other players are 40bb deep.

It takes a specific, narrow ruleset to tailor an AI to be able to play it at such a high level. Or more compute than we currently have in real-time.

Pluribus' matches against pros had each hand reset to 100BB.

Also, notably, the bot can still lose in the short term to terrible players, which was the thrust of my post. In fact, given its bluffing frequency, it might actually do worse against weaker players than it did against pros. Additionally, no human can realistically implement an AI strategy, meaning the AI is not a big detriment to the actual game of poker, as it's currently played.

2 comments

>Also, notably, the bot can still lose in the short term to terrible players, which was the thrust of my post. In fact, given its bluffing frequency, it might actually do worse against weaker players than it did against pros.

To be clear about why this might be true for someone who doesn't know much of poker strategy--the AI is (I presume) going to optimize for strategies that can't be beaten by changing one's own distribution of plays (for (simplified) example, if the AI raises a given hand 20% of the time and folds 80%, it's doing so because even if opponent were to call 100%, fold 100%, or something in-between; it wouldn't change the overall expected value of that hand with that distribution of plays). Professional human players, on the other hand, will absolutely cotton onto a player whose distribution of plays is not optimal and can therefore be exploited for profit.

Yes, and bluffing bad players generally doesn't work out as well as bluffing good players.

The AI notoriously bluffs a lot, which many bad player types accidentally counter by calling too often.

Good human players will also specifically exploit bad players, as you note, while the AI attempts instead to itself be unexploitable.

Cepheus doesn't care about "stack depth". It's just brutal statistics, this strategy beats your strategy unless your strategy is also approximately perfect regardless of what strategy you have. Good, bad, newbie, pro - it doesn't matter.

Yes, if it's very chip poor, Cepheus could run out of chips before it stops being unlucky and takes your money, but that's unavoidably true anyway in Poker due to it being a game of chance as well as skill.

Cepheus is actually a good Rorschach test for players who say "GTO" a lot. If you actually grasp what game theory optimality is about, you see Cepheus and you're like "OK, I'm done playing Heads Up Limit" (and indeed although it was once somewhat common you won't find any professionals playing this for some years now). But if you're the sort of person for whom it's just another phrase to sling around, like complaining you had a "bad beat" when you were never better than evens, chances are you see Cepheus and you think "I could beat that". Oh dear.

> Also, notably, the bot can still lose in the short term to terrible players, which was the thrust of my post.

This is exactly the useless observation I'm complaining about. Poker is a game of chance so of course anybody might get lucky in the very short term. If what you value is the fact you could win despite being terrible at it, just play roulette and save everybody else the trouble.

> In fact, given its bluffing frequency, it might actually do worse against weaker players than it did against pros.

It isn't about "doing worse" here and that's yet more of the mindset that got you into trouble here. The machine doesn't care about "doing worse" it just wins statistically over time. A grinder needs to make rent and buy groceries, so for them being up $50 after a full day feels like losing, it is losing. But the machine doesn't need money.

Again, if it's just about being able to get lucky and get some sort of temporary "High" from that, I'd commend roulette over poker, or, if you like cards, try blackjack.

No, there aren't any academics showing off bots that play full ring. Of course, there weren't any academics showing off that they could count Blackjack. At least not until after they were run out of Vegas with a fortune of the casino's money... Grinding online full ring with AI is not academically interesting, but it would be profitable. The former would cause you to publish (like Cepheus) the latter not so much.