| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wolttam 34 days ago
	In 2023 GPT-4 was allegedly 1.8T parameters. In 2026 we have ~100x smaller models (10-20B) that handily outperform it, and can indeed run on a laptop.

2 comments

WanderPanda 34 days ago

It highly depends on the task. For math and coding, sure. But for knowledge tasks GPT-4 is wayy better than even SOTA ~100B models. For my knowledge test cases the lines get blurry at >400B

link

rectang 34 days ago

How does "outperform" translate to the propensity of an LLM to hallucinate?

link

operatingthetan 34 days ago

There seems to be a mass delusion about how capable SOTA models actually are. That's my only explanation for how poorly I find them performing in basic knowledge tasks compared to how others describe their prowess.

link

rectang 34 days ago

I understand you to be implying that I shouldn't trust my perception that there's a meaningful difference in how much different models hallucinate. I will take that under advisement, but I am still interested in the answer to my original question.

link

operatingthetan 34 days ago

>I understand you to be implying that I shouldn't trust my perception that there's a meaningful difference in how much different models hallucinate.

Nope. Also I'm not GP.

link