Y
Hacker News
new
|
ask
|
show
|
jobs
by
XCSme
2 hours ago
Also Claude/Fable models are quite bad at instructions following:
https://artificialanalysis.ai/evaluations/ifbench