Hacker News new | ask | show | jobs
by simpx 3223 days ago
I'm curious about how bot learn to creep block?

How does the bot understand the value of long-term strategy?

1 comments

It doesn't understand the value; from the article: "We also separately trained the initial creep block using traditional RL techniques".