Hacker News new | ask | show | jobs
by dougmwne 1192 days ago
GPT 3.5 was certainly not, even though it knew a great many facts, it was like a 12 year old child with a search engine.

GPT-4 feels like an adult of average intelligence, again with a search engine. But it’s fast and it never gets tired or cranky.

I suspect the next iteration of these models will be obviously and conclusively smarter than I am, and probably most other people as well.

Since the training data is human, it stands to reason that the maximum intelligence that can be achieved by this approach is no more than say the most intelligent 1% or .1% of humans. There would need to be a large enough population of very smart folks to create a large training corpus.

1 comments

I'm not sure that limit is meaningful. Suppose that there's a system with the approximate reasoning ability of a person, but it doesn't forget things and it has studied every textbook ever written. Would you say that it's only as intelligent as a person?
Yes, I am trying to separate out complexity of reasoning from knowledge base and speed. A language model is like a person who has access to Google and several hours or days to research a response. Even a model with poor reasoning, ability can create answers faster than the most intelligent human on the planet, it’s just the nature of computational speed. And a model can be trained on the sum total of every book ever written, and every word ever published to the Internet but that doesn’t make it by itself.

One very key difference I see is that language models can’t create something absolutely novel. They would not have been able to invent calculus if it wasn’t in the training set, while it was possible for a few very smart humans to do such a thing.

Language models can generate novel functioning protein sequences that didn't exist before nevermind in the dataset.

https://www.nature.com/articles/s41587-022-01618-2