| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by BoorishBears 593 days ago

Saying these models are at GPT-4 level is setting anyone who doesn't place special value on the local aspect up for disappointment.

Some people do place value on running locally, and I'm not against then for it, but realistically no 70B class model has the amount of general knowledge or understanding of nuance as any recent GPT-4 checkpoint.

That being said these models are still very strong compared to what we had a year ago and capable of useful work

1 comments

simonw 593 days ago

I said GPT-4, not GPT-4o. I'm talking about a model that feels equivalent to the GPT-4 we were using in March of 2023.

link

int_19h 592 days ago

I remember using GPT-4 when it first dropped to get a feeling of its capabilities, and no, I wouldn't say that llama-3.3-70b is comparable.

At the end of the day, there's only so much you can cram into any given number of parameters, regardless of what any artificial benchmark says.

link

simonw 592 days ago

I envy your memory.

link

BoorishBears 592 days ago

You're free to intentionally miss their point, does them no good.

link