| Just because it's newly created doesn't mean that the structure of the language and the concepts it represents are actually new. It's clear that whatever tests he writes cover well established and understood concepts. This is where I believe people are missing the point. GPT4 is not a general intelligence. It is a highly overfit model, but it's overfit to literally every piece of human knowledge. Language is humanities way of modelling real world concepts. So GPT is able to leverage the relationships we create through our language to real world concepts. It's just learned all language up until today. It's an incredible knowledge retrieval machine. It can even mimick how our language is used to conduct reasoning very well. It can't do this efficiently, nor can it actually stumble upon a new insight because it's not being exposed in real time to the real world. So, this professors 'new' test is not really new. It's just a test that fundamentally has already been modelled. |
You've managed to essentially say nothing of substance. So it passes because structure and concepts are similar. okay. are students preparing for tests working with alien concepts and structures then because i'm failing to see the big difference here.
A model isn't overfit because you've declared it so. and unless GPT-4 is several trillion parameters, general overfitting is severely unlikely. But i doubt you care about any of that. Can you devise a test to properly asses what you're asserting ?