|
|
|
|
|
by redhale
71 days ago
|
|
I agree with your take. I don't really see why evals are assumed to be exclusively in the domain of data scientists. In my experience SWEs-turned-AI Engineers are much better suited to building agents. Some struggle more than others, but "evals as automated tests" is, imo, so obvious a mental model, and can be so well adapted to by good SWEs, that data scientists have no real role on many "agent" projects. I'm not saying this is good or bad, just that it's what I'm observing in practice. For context, I'm a SWE-turned-AI Engineer, so I may be biased :) |
|