Y
Hacker News
new
|
ask
|
show
|
jobs
by
tintor
1284 days ago
Improve itself through experimentation with reinforcement learning. This is how humans improve too. AlphaZero does it.
1 comments
lostmsu
1284 days ago
The amount of work in that area of research is substantial. You will see world shattering results in a few years.
Current SOTA:
https://openai.com/blog/vpt/
link
Current SOTA: https://openai.com/blog/vpt/