| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by lagmg05 624 days ago
	The question is if it solved the puzzle correctly before Norvig's article appeared. It could have been trained (I am told that existing models can be modified and augmented in any Llama discussion) on the article or on HN comments. There could even be an added routine that special cases trick questions and high profile criticisms.

3 comments

Fripplebubby 624 days ago

While this is technically possible, it is not remotely practical and the downside risk of pushing out a borked model is much higher than the upside.

Training the model is expensive (obviously), but even if you are only training it slightly, running evaluations to determine whether the particular training checkpoint is at or above the quality bar is expensive, too.

link

Terretta 624 days ago

> The question is if it solved the puzzle correctly before Norvig's article appeared. It could have been trained...

This caught me by surprise — is there a suggestion or evidence that despite the "knowledge cutoff" OpenAI is continuously retraining GPT-4o's chat-backing model(s) on day over day updates to the web?

link

oli5679 624 days ago

Sure,

I guess the best way to test this is to compose a new question, of a similar format.

link

johnisgood 623 days ago

I am not sure "of a similar format" suffices here, it should not have any resemblance or similarity to this new question or riddle.

link