Hacker News new | ask | show | jobs
by thumbuddy 1063 days ago
You also have to recall that these models were trained on the study materials of all of those tasks. That doesn't cheapen the achievement except to say, it's not "emergent behavior". Probably has half a billion weights dedicated to each of those exams.