Hacker News new | ask | show | jobs
by nojvek 954 days ago
I believe AlphaZero and MuZero would fit that definition.

Other than a system allowing play, they become superhuman purely through self play.

A benchmark for any AGI system is how fast it can learn from sparse unlabeled data and generalize.