Hacker News new | ask | show | jobs
by Kuinox 954 days ago
tldr: GPT-4 Turbo have worse score on synthetic benchmark of the first attempt because they speculate it's a smaller model, and isn't able to memorize as well the response.