Y
Hacker News
new
|
ask
|
show
|
jobs
by
sanderjd
1 day ago
I mean, in my experience some of this stuff is way closer to table stakes things than that. Like "the tool call didn't get totally confused" more than "did the communication with the user feel good".