|
|
|
|
|
by lagmg05
624 days ago
|
|
The question is if it solved the puzzle correctly before Norvig's article appeared. It could have been trained (I am told that existing models can be modified and augmented in any Llama discussion) on the article or on HN comments. There could even be an added routine that special cases trick questions and high profile criticisms. |
|
Training the model is expensive (obviously), but even if you are only training it slightly, running evaluations to determine whether the particular training checkpoint is at or above the quality bar is expensive, too.