Hacker News new | ask | show | jobs
by xrisk 209 days ago
Maybe explainable via the fact that these tests are part of the LLM training set?