Hacker News new | ask | show | jobs
by otabdeveloper4 62 days ago
Fundamentally they're the same technology with the same exact algorithms under the hood; only the post-training alignment differs.

That is, the difference you see is either placebo effect or you being lucky and better aligning with model post-training bias.

1 comments

Sorry, I was not specific enough. I did not mean that open source itself is not enough, I meant that an open source model that can actually run locally on my machine is not enough. a 32B model can not compete with a 250B+ state of the art model, at least that's my experience and seems to be the experience of many others as well.
Yes they're not as powerful, that means you need to feed them smaller tasks and rely more on plan mode.
Saying it "cannot compete" is like saying that a Kia cannot compete with a BMW.

Technically true in some sense, but fundamentally the two are the same exact thing and it's highly unlikely you have a task that actually requires a BMW.