Y
Hacker News
new
|
ask
|
show
|
jobs
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult
(
simonwillison.net
)
1 points
by
gingersnap
201 days ago
1 comments
ChrisArchitect
201 days ago
More discussion:
https://news.ycombinator.com/item?id=46037637
link