Evolving Curricula with Regret-Based Environment Design

Y	Hacker News new \| ask \| show \| jobs

	Evolving Curricula with Regret-Based Environment Design (accelagent.github.io)
	14 points by minch 1565 days ago

1 comments

aiaffinity 1565 days ago

Very interesting work and an amazing demo!! Btw, a very basic question: in the demo is there any curricula/training process being shown, or is it basically the final policies learned by each of the four competing approaches?

(ps: I co-wrote a short elementary paper on auto curricula design for RL in 2017 [0])

[0]: https://arxiv.org/abs/1703.07853

link

minch 1564 days ago

Thanks! The demo just shows the final agents after training (30K gradient updates). Interesting work re the reward maximizing curricula. I have not seen this before, so thanks for the pointer.

link