Hacker News new | ask | show | jobs
by benpacker 821 days ago
Going to be sharing this snippet with non-technical friends and family:

“ SIMA agents trained on a set of nine 3D games from our portfolio significantly outperformed all specialized agents trained solely on each individual one. What’s more, an agent trained in all but one game performed nearly as well on that unseen game as an agent trained specifically on it, on average”

Many people I talk to assume that when a LLM gets something right, it’s because that specific thing was in the training set. Although the experience of human transfer learning is intuitive to people, I find people have a hard time appreciating that it can happen in algorithms too.

1 comments

That's not right. If DeepMind's agents could really transfer what they learned from one game to another, that they've never seen before, their "specialized" agents, that only trained on one game, would then be able to perform well on unseen games. Instead, in order to get an agent with good performance in one unseen game they had to train it in all but that particular game.

That's typical of the poor generalisation displayed by neural nets and clearly not how humans do transfer learning.

But humans have already trained on an incredible number of games (including reality) when they play No Man's Sky for the first time. What they say here is that training on N-1 games makes you better at the Nth game. So you just continue to scale this up.
"An incredible number of games"? You're saying a kid can't pick up and play No Man's Sky if it's the first time they ever played a video game? Or that they can't get good at it if it's the first game they play?