| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by vineethy 313 days ago
	Interesting twist on automated curriculum learning. This paper is using an LLM for the environment and the policy. Other papers use LLMs for policy/value fn. Would be cool to see other reward strategies tying all these threads together