|
|
|
|
|
by SJC_Hacker
339 days ago
|
|
> I actually think this “cheating” is fine. In fact it’s preferable. The thing with IMO, is the solutions are already known by someone. So suppose the model got the solutions beforehand, and fed them into the training model. Would that be an acceptable level of "cheating" in your view? |
|
Finally, even if you aligned the model with the answers its weight shift of such an enormous model would be inconsequential. You would need to prime the context or boost the weights. All this seems like absurd lengths to go to to cheat on this one thing rather than focusing your energies on actually improving model performance. The payout for OpenAI isn’t a gold medal in the IMO it’s having a model that can get a gold medal at IMO then selling it. But it has to actually be capable of doing what’s on the tin otherwise their customers will easily and rapidly discover this.
Sorry, I like tin foil as much as anyone else, but this doesn’t seem credibly likely given the incentive structure.