Y
Hacker News
new
|
ask
|
show
|
jobs
Why LLM-as-judge fails for code evaluation. Here's what works.
(
navigara.medium.com
)
2 points
by
alienll
45 days ago