Hacker News new | ask | show | jobs
by midlightdenight 1106 days ago
Impressive work, but I think the title is misleading. Saying it is “near GPT-4” tends to imply that it outperforms ChatGPT (3.5). It does outperform it on a handful of tasks, but overall is slightly worse.

That aside I think this is really cool and hopefully we keep seeing this kind of improvement on smaller models.

I’m also curious if we know how many parameters the current model of ChatGPT 3.5 has? The API is really cheap, which makes me think it has less than the 175b in the larger GPT-3.

1 comments

Oh sorry I didn’t look properly then. I thought it outperformed GPT-3.5 consistently. My bad. Thanks for the correction.