Hacker News new | ask | show | jobs
by bpshaver 590 days ago
Section 6, "Controlled Evaluation," answers that question: https://arxiv.org/pdf/2304.03442