Hacker News new | ask | show | jobs
Why LLM-as-judge fails for code evaluation. Here's what works. (navigara.medium.com)
2 points by alienll 45 days ago