| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by djb_hackernews 120 days ago
	You have a misunderstanding of what LLMs are good at.

4 comments

cap11235 120 days ago

Poster wants it to play Jeopardy, not process text.

link

paganel 120 days ago

Not sure if you're correct, as the market is betting trillions of dollars on these LLMs, hoping that they'll be close to what the OP had expected to happen in this case.

link

raincole 120 days ago

The market didn't throw trillions of dollars to develop Llama 3 8B.

What GP is expected to happen has happened around late 2024 ~ early 2025 when LLM frontends got web search feature. It's old tech now.

link

paganel 120 days ago

The GP’s point was about LLMs generally, no matter the interface. I agree that this particular model is (relatively speaking) ancient in AI the world, but go back 3 or 4 years and this (pretty complex “reasoning” at almost instant speed) would have seemed taken out of a science-fiction book.

link

IshKebab 120 days ago

I don't think he does. Larger models are definitely better at not hallucinating. Enough that they are good at answering questions on popular topics.

Smaller models, not so much.

link

kleiba 120 days ago

Care to enlighten me?

link

vntok 120 days ago

Don't ask a small LLM about precise minutiae factual information.

Alternatively, ask yourself how plausible it sounds that all the facts in the world could be compressed into 8k parameters while remaining intact and fine-grained. If your answer is that it sounds pretty impossible... well it is.

link

kleiba 120 days ago

Did you see the part in my original post where it said "Not unexpected for an 8k model"?

link

vntok 119 days ago

Oh I saw it, you still have a fundamentally flawed comprehension of LLMs.

The size of the model does not factor as tiny models can use Internet to fetch factual information.

But you think they are accurate repositories of knowledge, even though it's physically impossible unless lossless infinite compression algorithms exist (they don't, can't and won't).

link

kleiba 119 days ago

I think you're overestimating your ability to assess what others think or comprehend.

link