Hacker News new | ask | show | jobs
by kenhwang 2840 days ago
Not even lategame, they overcommitted at all points in the game. Those tower dives for trades early game rarely lead to an objective or advantage. They were able to trade early because superior mechanical advantage matters more early game.

By the time midgame rolled around, it was pretty clear how naive their strategy was. It has an element of surprise to it since it's not a very human strategy, but just because it's not human doesn't make it remotely good.

It's like watching a a car drive on a sidewalk in reverse uphill and honking to avoid pedestrians. It's very impressive that the car figured out that driving on sidewalks reduces collisions with other vehicles, and honking reduces the chance of hitting pedestrians, and it's doing that all while driving in reverse which is very hard for a human to do. But no one in their right mind would call that good driving.

1 comments

That's a great example. It would be great if you could write a blog post on OpenAI Five. There's a LOT of misinformation on this and could use a treatment like this: https://www.alexirpan.com/2018/02/14/rl-hard.html