Hacker News new | ask | show | jobs
LLM Position Bias Benchmark: Swapped-Order Pairwise Judging (github.com)
1 points by zone411 59 days ago