Hacker News new | ask | show | jobs
by kelipso 312 days ago
A specific setup for the benchmark is just plain cheating, not Goodhart’s law.