Hacker News new | ask | show | jobs
by JacobAsmuth 15 days ago
The plural of anecdote is not data. What are your evals telling you?
1 comments

The % of accepted, actionable prompts is not up if I use Opus 4.7/4.6/4.8 if that is what you are asking.