Hacker News new | ask | show | jobs
LLM as Judge: Reproducible Evaluation for LLM Systems (nemorize.com)
1 points by reverseblade2 61 days ago