Hacker News new | ask | show | jobs
by RivieraKid 3040 days ago
That doesn't make any sense unless I'm missing something, A0 is suited for a completely different problem than protein folding...
1 comments

The AlphaZero algorithm (monte carlo tree search with value estimator trained by reinforcement learning) works on any environment you can simulate during play time, single player or not.
Any environment with finite action and state-spaces.
No, the key requirement which makes it difficult to use on real-world tasks is that you must be able to do a forward rollout of your environment in your decision-making process.