I think I can wrap my head around neural nets being superior at games with perfect information like chess or go. But how would you teach bluffing and randomness to a neural net?
Top professionals are building their strategy around game theory. They'll attempt to play in such a way that they aren't exploitable and look to deviate when they've spotted a weakness in their opponent's play.
Basically, the game theory optimal strategy is unexploitable. In every situation, the best you can do is break even by also playing the optimal strategy. If you deviate from optimal strategy, the optimal strategy will beat you, but it's possible that a strategy tailored to taking advantage of your specific deviations would beat you more quickly.
Unexploitable play typically means that you bet a size with a range of holdings that would make your opponent indifferent to all of his options (And the converse is true when facing a bet). For humans, this means that they gravitate to a few standard bet sizes, while a computer could, in theory, balance their range with much more granularity.
Last I read, for training the neural net it'll play billions+ hands against versions of itself designed to exploit various weaknesses. It'll start out by performing random actions, for example, say it'll have a 33% chance to call your bet, raise, or fold. It then starts to see that it does better when it raises your bet with the nuts and also as a bluff. Eventually, it arrives at an equilibrium strategy.
Since computers are much better at randomness than humans are, they're able to more effectively play these types of strategies and with more complexity of bet sizing. There is what's called a mixed strategy, a strategy where given a situation with the same hole cards you will call, raise, or fold to a bet with some non-zero probability. Doing that as a human is very difficult, but it's something computers manage to do quite easily.
Note that professional in this context just means made a living through poker. There are a ton of people like myself that earned a good living playing online or in casinos. A small minority received long-term sponsorships.
1. I started playing in high school and college. Like everything else I do I dove in to get better and eventually I was good enough that I was making good money playing online. At that point I started to pursue things full time.
2. For me it was the latter. The very top players in skill and visibility are typically the ones getting sponsorships. Win a big tournament, win at the highest level of online games, or make a tv appearance at a final table and you'll find opportunities for sponsorships. For most players, these sponsorships really just add stability to their income with the bulk still coming from winnings. Though there are the poker personalities who make the bulk of their income from sponsorships and TV deals.
3. You can definitely do it part time at some levels, but to keep up it does require quite a bit of study time to win at meaningful stakes.
4. When the US legal landscape changed in 2011, getting your money out of the sites that were still willing to serve Americans got more challenging. Moving out of the country wasn't an option for me. With the game getting harder all the time due to the proliferation of good strategy material and botting, it seemed like it was time to move on.
So with the online games, etc. is the point just that you are better than the next guy to join the game so you just end up winning against players that are not as good? How reliable is something like that? It's true that a sucker is born every minute, but is it really feasible to make a living off it vs just staying income-neutral (e.g. win some, lose some, not really make any profit)?
It's fairly reliable. Due to the random nature of poker, it's very easy to delude yourself into thinking that you're a better player than you are. This is true for hobbyists, gamblers, and professionals alike. Professional players may move up in stakes or even stay at the same stakes and think that they're stronger than the competition and write off their bad performance as a run of bad luck. Runs of bad luck (and good) can significantly skew your results even over a hundred thousand hands.
Because of this, players of all skill levels will play a long time in games where they are not favored. This is part of why it's so important to keep studying both your play and that of your opponents. If you rest on your laurels, those that do study will surpass you and you'll find yourself playing at a disadvantage without realizing it.
Online you're really able to make a good amount off even a small edge, as measured in big blinds per 100 hands, because hands are dealt so quickly and you can play more than one table at a time. For most of my career I was playing ~1000 hands/hour. In reality due to the rake, the cut the site takes from each pot, you need to have a substantial edge to make money, but this isn't as hard to achieve as you might think.
The main goal of any poker game is to win money. You win money from your opponents, you don't play agains the house. Although , the house (poker site) does take a small percentage of every pot you win, this is called rake. Do note, that in some games, rake can be so high that it is not profitable to play the game even if you are much better than your opponents. The only way to win money on the long run in poker is to be better than your opponents, so you need to look for bad opponents, it is a well know strategy called table selection or "bum hunting" :)
Nowdays it is still very feasible to make a living off poker, but you will have better chance finding bad player in live games.
Not who you're replying to, but I can give a few answers, having also played professionally for some time.
1. Lots of means this can happen. "Professional poker player" is generally taken to mean "derives primary source of income from playing poker" or sometimes "spends the majority of their time playing poker". There's no exact qualification.
2. Not generally sponsorship driven, although there are some modes of sponsorship that do factor in in some ways - the primary means of income is basically by winning money from other players. A secondary component is often rakeback, in online play. In cash games, you can join and leave at any time and your winnings or losses are simply the amount you are up or down in that particular session. In tourmanent play there is a payout structure based on your placement in the tournament (often something like 25% of the total prize pool to 1st, 15% to second, etc.)
3. It can be done part time. There is nothing to say that your bankroll can't be seeded or supplemented by external means, and you can play cash games for short or long periods. Tournaments are typically long (at least if you remain in them a long time, though you can be eliminated at any time, basically). I used to play approximately 40 hours a week (since I was treating it like a job), but now I play 10-15 and it represents about 30-40% of my yearly income.
4. I personally quit because I found the stress associated with having it as the sole means of providing myself too overwhelming. One can have a stretch of negative earnings that can last hours, days, weeks, even months - and it can be psychologically damaging in some ways. I also found that I preferred to keep it as a hobby than a means of income, since I enjoyed it more that way.
We know that there exists an optimal strategy, but that we still aren't close to achieving it. It's a zero sum game where both parties have the same lack of information and the betting order rotates. I think it has to have an optimal strategy.
At higher levels poker is about game theory, for example, the player bluffs at an optimal frequency in a certain situation so as to be indifferent to whether the opponent calls or folds.
Exploitative strategies, based on understanding opponent weaknesses and tendencies will win $ at a higher rate, but are themselves exploitable.
For example, almost never bluffing and playing only strong cards crushes beginners who play too many hands and call too much.
This strategy is easily beaten though by stealing most pots and then not paying off the infrequent big bets (strong hands don't come often enough).
A "perfect" game theory strategy is like armor, slowly bleeding the opponent every time they deviate from perfection themselves.
not sure if that helps but maybe some seeds to google at least
> This strategy is easily beaten though by stealing most pots and then not paying off the infrequent big bets (strong hands don't come often enough).
To dig deeper:
Or you can try actively punishing the big hands by folding out early. Of course, that strategy opens you up to being bled by your opponent bluffing strong hands. Attempting to actively punish the big hands here is a deviation. This is what jspiral means by "deviations from perfection".
I imagine it's mostly just playing the percentages. Bet when it has a high percentage of winning, fold when it doesn't. It doesn't need to read its opponents if it can play the percentages perfectly.
No, this is a terrible idea. If you play like this consistently, you are basically telling your opponents when to play against you and when to get out of the way. They can even pick bet sizes (assuming no limit) to refine your possible hands very accurately.
I'm usually surprised in tournaments by the number of people willing to play with me after I've sat quietly folding for the first three rounds... By all means, everyone should fold and give me the blinds, but there always seems to be a player or two who bite!
This is a very naive interpretation of the game of poker. Professional human players are already very good at calculating the percentages, and any joe playing from his computer has access to a calculator.
The reason why simply playing the numbers fails is that if I know an opponent is playing this way, I'll just fold every time he decides to play.
This is true for Limit Hold'em, but very much not true for No Limit. Limit Hold'em is a solved game, because as long as you're playing the odds, you can play perfectly. No Limit changes things because the bets can vary wildly. If you play a tight game (just play the odds), and opponent will get out whenever you're in, and will bluff just to see if you call or fold.
Bluffing is a major component in No Limit, and there are very different profitable playing strategies.
I fell to this when I randomly decided to play some limit hold'em one night. Kept losing to a guy that chased every chance at odds he could, because I couldn't make bets big enough to scare him out. Lesson learned!
That's not true at all for limit, the perfect bot's bluff frequency is probably higher than most humans. Play against it yourself here http://poker.srv.ualberta.ca
Top professionals are building their strategy around game theory. They'll attempt to play in such a way that they aren't exploitable and look to deviate when they've spotted a weakness in their opponent's play.
Basically, the game theory optimal strategy is unexploitable. In every situation, the best you can do is break even by also playing the optimal strategy. If you deviate from optimal strategy, the optimal strategy will beat you, but it's possible that a strategy tailored to taking advantage of your specific deviations would beat you more quickly.
Unexploitable play typically means that you bet a size with a range of holdings that would make your opponent indifferent to all of his options (And the converse is true when facing a bet). For humans, this means that they gravitate to a few standard bet sizes, while a computer could, in theory, balance their range with much more granularity.
Last I read, for training the neural net it'll play billions+ hands against versions of itself designed to exploit various weaknesses. It'll start out by performing random actions, for example, say it'll have a 33% chance to call your bet, raise, or fold. It then starts to see that it does better when it raises your bet with the nuts and also as a bluff. Eventually, it arrives at an equilibrium strategy.
Since computers are much better at randomness than humans are, they're able to more effectively play these types of strategies and with more complexity of bet sizing. There is what's called a mixed strategy, a strategy where given a situation with the same hole cards you will call, raise, or fold to a bet with some non-zero probability. Doing that as a human is very difficult, but it's something computers manage to do quite easily.