Hacker News new | ask | show | jobs
by bfors 463 days ago
Perhaps they already evaluated their LLM judge model (with another LLM)