| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by distant_hat 2201 days ago
	People don't understand what exponential improvement means. GPT-3 is a 175B with B parameter model. Another few rounds of doubling and we could be seeing models spit out short stories and novellas.

2 comments

derivativethrow 2201 days ago

Have you read the GPT-3 paper? The authors are pretty forthright about how the GPT techniques likely won't scale much beyond 175B parameters.

link

ur-whale 2201 days ago

Yet, the darn thing still can't reason.

link

visarga 2201 days ago

They "can't reason" but they can solve symbolic integration and differential equations (https://arxiv.org/pdf/2006.06462.pdf), beat us at all board games, and more recently get to be almost impossible to tell apart from human writing (GPT3).

link

jbay808 2201 days ago

Five years ago we would have said "the darn thing still can't write a cohesive paragraph".

link

wnoise 2201 days ago

That's still true. It can write a paragraph that's usually grammatical, and can stay on topic, but it's missing things like facts, or even the ability to remember which side of an argument it's taken previously.

link

jbay808 2200 days ago

I think over the course of several paragraphs that's true, but within one it tends to be pretty good.

link

distant_hat 2200 days ago

People look at the present and feel like that's all that's there. You don't look at a 2-year old and say he barely makes any intelligible sounds yet.

link

Enginerrrd 2201 days ago

To be fair, it wasn't trained to.

To be really fair, I think you need to more precisely define what you mean by "reason". It absolutely CAN reason by some measures.

link