|
|
|
|
|
by mailswept_dev
289 days ago
|
|
Totally agree with this — especially the part about end-to-end evals.
I’ve seen too many teams rely only on manual testing and miss obvious regressions.
Checkpoints + lightweight e2e evals feel like the sweet spot before things get too costly. |
|