| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ai_slop_hater 26 days ago
	Are “frameworks for evaluating the output of an agent” and "LLM evals" different? :) If yes, how?

1 comments

brianwmunz 26 days ago

"LLM evals" is maybe an overused term because it can mean a bunch of things. This article talks about LLM-as-a-judge where an LLM scores another system's outputs.

link