Hacker News new | ask | show | jobs
by YetAnotherNick 146 days ago
Benchmarks like ARG AGI are super price correlated and cheap to run. I think it's very easy to prove that the models are degrading.