Y
Hacker News
new
|
ask
|
show
|
jobs
by
ai_slop_hater
26 days ago
Are “frameworks for evaluating the output of an agent” and "LLM evals" different? :) If yes, how?
1 comments
brianwmunz
26 days ago
"LLM evals" is maybe an overused term because it can mean a bunch of things. This article talks about LLM-as-a-judge where an LLM scores another system's outputs.
link