|
|
|
|
|
by raw_anon_1111
198 days ago
|
|
You need a way to test model changes regardless as models in the same family change. Is it really a heavier lift to test different model families than it is to test going from GPT 3.5 to GPT 5 or even as you modify your prompts? |
|
maybe another way of saying the same thing is that there is still a lot of work to make eval tooling a lot better!