|
|
|
|
|
by gdb
2920 days ago
|
|
To be clear: - The 1v1 bot played at The International used a special creep block reward (and a big if statement separating that part of the agent from the self-play trained part). It trained for two weeks. - A 2v2 bot discovered creep blocking on its own, no special reward. It trained for four weeks. - OpenAI Five does not have a creep blocking reward, but neither (to our knowledge) does it creep block currently. Trained for 19 days! |
|