Hacker News new | ask | show | jobs
by ai_slop_hater 26 days ago
Are “frameworks for evaluating the output of an agent” and "LLM evals" different? :) If yes, how?
1 comments

"LLM evals" is maybe an overused term because it can mean a bunch of things. This article talks about LLM-as-a-judge where an LLM scores another system's outputs.