| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by oersted 793 days ago

In the English category: GPT-4-0314 (ELO 1166), Llama 3 8B Instruct (ELO 1161), Mistral-Large-2402 (ELO 1151), GPT-4-0613 (ELO 1148).

You are right, I toned down the language, I got a bit overexcited, and I missed the difference in the versions of GPT-4. And LMSYS is a subjective benchmark for what users prefer, which I'm sure has weird inherent biases.

It's just that any signal of an 3.8B model being anywhere in the vicinity of GPT-4 is huge.

1 comments

moralestapia 793 days ago

Yeah, GPT3.5, in a phone, at ~1,000 tokens/sec ... nice!

link

mlyle 793 days ago

> at ~1,000 tokens/sec

12 tokens per second.

link

moralestapia 793 days ago

Whoops, made the same mistake as @ignoramous :P

link