Y
Hacker News
new
|
ask
|
show
|
jobs
by
suprfnk
35 days ago
But then, if an agent picks the best response, how would you know that
that
is reliable?
2 comments
xienze
35 days ago
Obviously you have multiple agents justify why they picked a certain response and then create another agent that picks the solution with the best justification.
link
kkyr
35 days ago
touché
link
onion2k
35 days ago
You could get the agents to output something structured and then use a deterministic test if you're worried about that.
link