| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by otabdeveloper4 62 days ago
	Fundamentally they're the same technology with the same exact algorithms under the hood; only the post-training alignment differs. That is, the difference you see is either placebo effect or you being lucky and better aligning with model post-training bias.

1 comments

paulluuk 61 days ago

Sorry, I was not specific enough. I did not mean that open source itself is not enough, I meant that an open source model that can actually run locally on my machine is not enough. a 32B model can not compete with a 250B+ state of the art model, at least that's my experience and seems to be the experience of many others as well.

link

eloisant 61 days ago

Yes they're not as powerful, that means you need to feed them smaller tasks and rely more on plan mode.

link

otabdeveloper4 61 days ago

Saying it "cannot compete" is like saying that a Kia cannot compete with a BMW.

Technically true in some sense, but fundamentally the two are the same exact thing and it's highly unlikely you have a task that actually requires a BMW.

link