|
|
|
|
|
by simonw
358 days ago
|
|
I think the key part if that advice is the without evidence bit: > I suggest not thinking of switching model as the main axes of how to improve your system off the bat without evidence. If you try to fix problems by switching from eg Gemini 2.5 Flash to OpenAI o3 but you don't have any evals in place how will you tell if the model switch actually helped? |
|
Of course, Hamel is right too. In the long run, people will need to take more scientific approach. They already do, if inference costs are the main concern.