Hacker News new | ask | show | jobs
by XCSme 2 hours ago
Also Claude/Fable models are quite bad at instructions following: https://artificialanalysis.ai/evaluations/ifbench