Hacker News new | ask | show | jobs
by esafak 150 days ago
They are, in benchmarks. In practice Anthropic's models are ahead of where their benchmarks suggest.
1 comments

Bear in mind that lead may be, in large part, from the tooling rather than the model