Hacker News new | ask | show | jobs
by sgillen 2876 days ago
>> Image processing and robotics controls are largely solved

Image processing and robotic control are very far from being solved problems. I guess you are saying that in the case of alpha go it would not be a super difficult step to have a camera and robotic hand physically move pieces around, and that's probably true. But I think in the DOTA case are new image processing challenges that interact with the AI in interesting ways.

I'm mostly talking about the need to move the game's camera around to gain more information. If you don't see your ally on your screen and need to see how they are handling a gank or something (full disclosure I don't play DOTA at all this could be a silly scenario). Then the AI would have to recognize this and move the camera to the allies location in order to gain that information. So really the novelty here would be in the network to somehow realize what information it needs and then further to learn how to gather that information. I honestly think that sounds like an extremely difficult next step.

3 comments

Human beings are restricted to what's on the screen, which includes not only the camera's perspective of the playfield but also, crucially, the minimap in the corner of the screen. Plus there's weird stuff like how you don't have perfect information about the health/mana of your teammates unless you hold down Alt... so yeah OpenAI is "cheating" somewhat and it would be really cool to see, once it evolves further, restrictions that allow it to better mimic human player capabilities.

That said, everything they've done so far is absolutely incredible (especially now that the AI can draft!!)

This is something i noticed; a human initiator would get counter initiated almost instantly, every single time by OpenAI. The blink dagger is much less effective. Pro humans do this too, but not every single time with perfect timing.

Humans dont concentrate on the whole screen, attention is directed...

It would be interesting to take this project up a few levels later on and see how it compares to direct API interaction.

I would love to see camera/mechanical interface like mentioned by others. Similarly, like you said humans don't focus on the whole screen. I would love to see how well the AI could perform if it was given something like blinders where only a small portion of the screen is in focus at any one time much like how human eyes work.

I believe human counter-initiation is only frame-perfect when the human is anticipating the initiation to happen (baiting).

Otherwise, you still have to add time to react plus mouse travel time.

There is actually an interesting paper on that, you can find it on the youtube channel two minute papers.

Basically, they let an AI loose on a simplified version of Quake's Capture the Flag. The AI processes game video output only and has learned several key strategies. The latest update has the AI with a winrate of 71% against top humans. Unlike the DOTA match the AI has no restricted reaction time.

The AI seems to be jittering the camera left and right to reconstruct a 3D image reliable from the screen which is aquite interesting way to compensate for lack of 3D vision (and the compensation our brain is capable of naturally to get a 3D intuition from a 2D image)