Hacker News new | ask | show | jobs
LLM-as-Judge: Evaluating and Improving Language Model Performance in Production (segment.com)
2 points by n2parko 775 days ago