|
|
|
|
|
by 0xDEAFBEAD
414 days ago
|
|
Are there any of these tools which will use your evals to automatically recommend a model to use? Imagine if you didn't need to follow model releases anymore, and you just had a heuristic that would automatically select the right price/performance tradeoff. Maybe there's even a way to route queries differently to more expensive models depending on how tricky they are. (This would be more for using models at scale in production as opposed to individual use for code authoring etc.) |
|
What we've seen most successful is making recommendations in the agent creation process for a given tool/workload and then leaving them somewhat static after creation.