Hacker News new | ask | show | jobs
by tintor 1284 days ago
Improve itself through experimentation with reinforcement learning. This is how humans improve too. AlphaZero does it.
1 comments

The amount of work in that area of research is substantial. You will see world shattering results in a few years.

Current SOTA: https://openai.com/blog/vpt/