| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by RivieraKid 3040 days ago
	That doesn't make any sense unless I'm missing something, A0 is suited for a completely different problem than protein folding...

1 comments

rytill 3040 days ago

The AlphaZero algorithm (monte carlo tree search with value estimator trained by reinforcement learning) works on any environment you can simulate during play time, single player or not.

link

eutectic 3040 days ago

Any environment with finite action and state-spaces.

link

rytill 3028 days ago

No, the key requirement which makes it difficult to use on real-world tasks is that you must be able to do a forward rollout of your environment in your decision-making process.

link