Y
Hacker News
new
|
ask
|
show
|
jobs
by
alphabetting
842 days ago
Impressive benchmarks here. The 90% eval for one of the math categories on 0-shot vs 74.5% GPT-4 8-shot is nice.
https://twitter.com/AnthropicAI/status/1764653830468428150
1 comments
monkeydust
842 days ago
Nice. GPT-4 beater finally perhaps (until 5 is launched which I guess must be pretty soon now)
link