| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by benpacker 868 days ago

Going to be sharing this snippet with non-technical friends and family:

“ SIMA agents trained on a set of nine 3D games from our portfolio significantly outperformed all specialized agents trained solely on each individual one. What’s more, an agent trained in all but one game performed nearly as well on that unseen game as an agent trained specifically on it, on average”

Many people I talk to assume that when a LLM gets something right, it’s because that specific thing was in the training set. Although the experience of human transfer learning is intuitive to people, I find people have a hard time appreciating that it can happen in algorithms too.

1 comments

YeGoblynQueenne 868 days ago

That's not right. If DeepMind's agents could really transfer what they learned from one game to another, that they've never seen before, their "specialized" agents, that only trained on one game, would then be able to perform well on unseen games. Instead, in order to get an agent with good performance in one unseen game they had to train it in all but that particular game.

That's typical of the poor generalisation displayed by neural nets and clearly not how humans do transfer learning.

link

utdiscant 868 days ago

But humans have already trained on an incredible number of games (including reality) when they play No Man's Sky for the first time. What they say here is that training on N-1 games makes you better at the Nth game. So you just continue to scale this up.

link

YeGoblynQueenne 868 days ago

"An incredible number of games"? You're saying a kid can't pick up and play No Man's Sky if it's the first time they ever played a video game? Or that they can't get good at it if it's the first game they play?

link