Hacker News new | ask | show | jobs
by infgeoax 731 days ago
Hmm, but is it really "generalizing" or just pulling information from the training data? I think that's what this benchmark is really about: to adapt to something it has never seen before quickly.