Y
Hacker News
new
|
ask
|
show
|
jobs
by
sghiassy
138 days ago
N00b Question - how do you measure performance for AI agents like the way they did in this article? Are there frameworks to support this type of work?