Generative AI is overrated, long live old-school AI | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

	Generative AI is overrated, long live old-school AI (encord.com)
	227 points by Buhljingo 1191 days ago

26 comments

version_five 1191 days ago

Seems like the person who wrote the blog works in "classical" deep learning. So do I, so here's the fairest take I can come up with: "AI" has for recent memory been a marketing term anyway. Deep learning and variations have had a good run at being what people mean when they refer to AI, probably overweighting towards big convolution based computer vision models.

Now, "AI" in people's minds means generative models.

That's it, it doesn't mean generative models are replacing CNNs, just like CNNs don't replace SVMs or regression or whatever. It's just that pop culture has fallen in love with something else.

JohnFen 1191 days ago

Spot on. I work with deep learning systems in industrial control, and generative models are simply ill-suited for this sort of work. Wrong tool for the job.

But neither the traditional nor generative models are "AI" in the sense that normal people think when they hear "AI".

nico 1191 days ago

To me what’s exciting about Chat/GPT type of tech, is that they can be the “coordinators” of other models.

Imagine asking an AI assistant to perform a certain industrial control task. The assistant, instead of executing the task “itself”, could figure out which model/system should perform the task and have it do it. Then even monitor the task and check it’s completion.

version_five 1191 days ago

This is just wrong.

Also, even if a LLM could do that, so could a shell script, without the risks involved in using "AI" for it, or for now the ridiculous external dependence that would involve.

I wonder if in 10 years people will be stuck debugging Rube-Goldberg machines composed of LLM api calls doing stuff that if-statements can do, probably cobbled together with actual if-statements

jrussino 1190 days ago

> I wonder if in 10 years people will be stuck debugging Rube-Goldberg machines composed of LLM api calls doing stuff that if-statements can do, probably cobbled together with actual if-statements

Sounds like an extension of https://en.wikipedia.org/wiki/Wirth%27s_law. How many times have I done some simple arithmetic by typing it into my browser's bar and checking out the google calculator results? When a generation ago I would have plugged it into a calculator on my desk (or done it in my head, for that matter...). I would be entirely unsurprised to hear that in another generation we're using monstrously complicated "AI" systems to perform tasks that could be done way more simply/efficiently just because it's convenient.

vidarh 1190 days ago

My son regularly uses Alexa as a calculator, and also asks Alexa all kinds of things without a thought as to whether the output triggers a simple pattern match and gets fed to a specialised process or triggers a web search or is processed some other way. It's all conversational anyway. So the day Amazon plugs an LLM into it, it's not a given he'll even notice the difference for some time.

querez 1190 days ago

It's not wrong. It's how modern systems operate. E.g. look at Google's SayCan (https://say-can.github.io/) which operates exactly like this (an LLM ordering a Robot around).

JohnFen 1191 days ago

> doing stuff that if-statements can do, probably cobbled together with actual if-statements

In other words, old-school expert systems.

baq 1190 days ago

With the limit of 25k words it might actually be reasonable to test out a prompt for an expert system… but I’d still leave reasoning to something else, for now. Z3, prolog or some forward chaining tool like clips, but have the LLM hallucinate some of the rules?

Hermitian909 1190 days ago

LLMs are already taking over these sorts of systems in industry.

There are lots of systems where you're taking some information about a user and making a best guess at what action the system should take. Even without a need for super high accuracy these rule systems can get surprisingly complex and adding in new possible decisions can be tricky to maintain. In LLM world you just maintain a collection of possible actions and let the LLM map user inputs to those.

nico 1191 days ago

Sure, maybe you can use a shell script, but now the AI assistant can write it based on your verbal/text description, and then the assistant can also run it for you after you’ve checked it.

What your are saying is: “why use the washing machine, if I my clothes are even cleaner when I wash them myself - I also spend less detergent and less water”.

You are free to keep doing your laundry by hand.

But I bet most people prefer the washing machine.

xwdv 1191 days ago

Spare me the shitty analogies. We write shell scripts because it’s cheap, fast, and the behavior is very predictable.

Like it or not, an AI’s behavior is a black box and can’t be “proven” to execute exactly the same every time for the scenarios you are targeting.

A shell script will do exactly what it has been written to do every time, unless tampered with. And if changes need to be made, it can be done quickly without need for retraining, god knows how long that would take for an AI to learn something new. God help you if you need to maintain “versions” of your AI, trained for different things.

Face it, AI are pointless and slow for certain classes of problems.

whatshisface 1190 days ago

I think you're fighting an uphill battle because of what you picked to defend here - shell scripts are very easy to write, and I have a hard time imagining a future where someone tells an LLM,

"Write me a shell script that runs run_control.py with the speed argument set to one hundred."

to get,

"./run_control.py --speed 100"

DonHopkins 1190 days ago

You're totally undermining your own argument by using "shell script" instead of "python script".

pstorm 1190 days ago

You are getting a surprising amount of backlash from this, but I think you are right. There may be better tools for the job, but general tools tend to win out as they get "good enough"

digitalsushi 1190 days ago

mediocre is acceptable for most things. i'd rather have 1000 free photos from my wedding than 32 perfect ones. i still ended up with more than 32 perfect ones.

calf 1190 days ago

The central question is that a controller is assumed to be specifiable and thus formally verifiable through model checking in principle.

With a neural network you have a black box and for example with ChatGPT it doesn't even have a specification. It turns the verification process upside down.

JohnFen 1191 days ago

I'm not sure how the likes of ChatGPT could accomplish that even in theory, but I won't say it's not possible at some point in the future. Gpt itself, perhaps, someday.

nico 1191 days ago

Already ChatSpot is doing it. Their system is essentially a ChatGPT-enhanced Hubspot management system using chatux.

ChatSpot can understand your commands and then perform actions in the system for you, for example add a lead, change their contact info, write a blog post, publish it, add an image…

Edit: but if you connected it with physical actions, it could control your house, maybe check your smart refrigerator, order food on Instacart, send you recipe, schedule the time to cook in your calendar, request an Uber to pick you up from work, invite someone over, play music…

There’s a discussion about this on another homepage thread here: https://news.ycombinator.com/item?id=35172362

JohnFen 1191 days ago

Ah, ok. I thought you were talking about something a bit more profound than that.

nostromo123 1190 days ago

None of this is "industrial control task", aka "real work".

IanCal 1191 days ago

You can just tell the models to and tell them what tools they have available and how to call out to them. Langchain supports this iirc.

burnished 1190 days ago

What do you imagine this would do that existing automation does not?

nico 1190 days ago

Perform tasks that humans do now, but at scale, automatically.

We are going to be able to automate everything and anything with the proper feedback loops.

For example, you could have an app that writes itself, deploys itself, tests itself, receives feedback, updates itself based on the feedback, writes additional tests, does CI/CD.

At that point you will be just creating and directing. Or you can choose whatever you actually want to execute.

And then if those same kind of processes are given access to physical tools, they could do all of our manufacturing, design and build their own machines and infrastructure.

We could essentially collaborate with our systems in the most amazingly seamless way.

njarboe 1190 days ago

The term "AI" was corrupted as described. People now use the term "artificial general intelligence" (AGI) to refer to what used to be called AI.

JohnFen 1190 days ago

I was talking about what the average person thinks when they hear "AI". The average person has never even heard the term "AGI".

fakedang 1191 days ago

I'm curious about your work, because I worked on something similar during my grad school. What kind of applications in industry do you use deep learning systems for? Process control?

JohnFen 1191 days ago

Yes, process control. It's used in coordination with vision systems to analyze work pieces, determine the best way of processing them, and direct other machinery how to do that processing.

fakedang 1190 days ago

That's cool. If you don't mind me asking, would you have any shallow level stuff that I could read on about this? Even a website or a blog post would be great.

In my grad school, we were working on something similar - using computer vision to analyze reactor flows to then change process variables. The results would be fed back into the system for RL. Too bad the project sorta froze after I graduated.

JohnFen 1190 days ago

Your grad school project sounds very similar, yes, although we work with discrete objects rather than fluids. Fluid dynamics is much, much more complicated.

We actually use more than one neural network for this. The software is designed so the NN component is a plugin. The reason we do this is because some types of neural nets work better for some tasks than others.

Most (but not all) of our nets are convolutional.

Since you've already done some work with this sort of thing, I'm unsure about what level of overview would be of value to you, but this looks reasonable for a technically competent person who is new to the topic:

https://towardsdatascience.com/a-comprehensive-guide-to-conv...

As with all neural nets, the "secret sauce" isn't the code, it's the training.

sterlind 1191 days ago

do people actually use SVMs anymore?

like, regression, sure - because it's a tool to measure how well a hypothesis (polynomial function) matches the data (points.) and CNNs are still foundational in computer vision. but the first and last time I heard of SVMs was in college, by professors who were weirdly dismissive of these newfangled deep neural networks, and enamored by the "kernel trick."

but aren't SVMs basically souped up regression models? are they used in anything ML-esque, i.e. besides validating a hypothesis about the behavior of a system?

superdisk 1190 days ago

> but the first and last time I heard of SVMs was in college, by professors who were weirdly dismissive of these newfangled deep neural networks, and enamored by the "kernel trick."

LOL. Exact same experience in my college courses. Glad to know it's universal.

teruakohatu 1190 days ago

> do people actually use SVMs anymore?

Yes they are. They allow for non-linear decision boundaries and more dimensions than rows of data, which for many other ML methods is a problem.

Linear regression, logistic regression, SVM and CART decision trees are all still very popular in the real world where data is hard to come by.

jacksnipe 1191 days ago

We loved them in medical testing. Very explainable models.

rkhacker 1190 days ago

The Generative AI is the AI for the masses. While people were getting overhyped with all the possibilities and promises of AI and deep learning etc. it is for the first time that they can also tinker and get surprised by its results. People feel creative interacting with it.

gautamdivgi 1191 days ago

Isn’t most of the mathematics of AI old, as in really old?

Regression, both linear and logistic are from the mid 1800s to early 1900s. Neural networks, at least the basics are from around 1950.

What has really changed is the engineering, the data volume and the number of fields we can apply the mathematics to. The math itself (or what is the basis of AI) is really old.

sterlind 1190 days ago

backpropagation didn't get solved until the '80s, weirdly. before then people were using genetic algorithms to train neural networks.

and it was only in the last decade that the vanishing gradients problem was tamed.

my impression is that ML researchers were stumbling along in the mathematical dark, until they hit a combination (deep neural nets trained via stochastic gradient descent with ReLU activation) that worked like magic and ended the AI winter.

version_five 1190 days ago

Right, and the practice of neural networks has significantly overshot the mathematical theory. Most of the aspects we know work and result in good models have poorly understood theoretical underpinnings. The whole overparamiterized thing for example, or generalization generally. There's a lot that "just works" but we don't know why, thus the stumbling around and landing on stuff that works

cma 1190 days ago

> and it was only in the last decade that the vanishing gradients problem was tamed.

One of the big pieces was Schmidhuber's lab's highway nets, done ~30 years ago, but just didn't land until a more limited version was rediscovered.

fnordpiglet 1191 days ago

AI has been marketing term since the day it was coined. It means literally nothing, which means it can mean anything.

burbankio 1191 days ago

As the old joke goes, "AI" is anything that doesn't work yet.

Once an "AI" system becomes reliable, we quickly take it for granted and it no longer seems impressive or interesting. It's just a database. Or an image classifier. Or a chatbot.

Tanjreeve 1190 days ago

I'd argue a database is possibly as far from an AI as you get. Indexing and data structures and storage systems that go into them are very deterministic data structures you can model out and roughly know the behaviour of before writing a single line of code. Image classifiers and Chatbots you don't know what you're getting out till you train it and deploy it.

fnordpiglet 1191 days ago

Magic is just science we don’t understand yet.

Jensson 1191 days ago

Science is just magic we do understand is a cooler take.

fnordpiglet 1190 days ago

Yes that’s the one I use for my daughter ;-)

kmeisthax 1190 days ago

People calling neural-net classifiers "old-school" AI confused me. For a second I thought they were talking about the really old "expert systems" with everything being a pile of hard-coded rules.

01100011 1190 days ago

It still feels like there's a place for these rule based systems(Prolog?) to at least place some constraints on the output of non-deterministic, generative AI. If nothing else, have a generative AI generate the ruleset so you have some explicit rules you can audit from time to time.

theLiminator 1190 days ago

Yeah, i think one potential way to use blackbox ai in newer systems is having guardrails that are validated as safe (but perhaps non-optimal) and ensuring that the ai takes action within that sample space. Obviously this is hard problem, but might open the doors for policies (in self-driving cars, for example) to be entirely ai driven.

earthboundkid 1190 days ago

Obviously the solution is to get the LLM to output Prolog. Give it positive feedback if the Prolog compiles. :-)

01100011 1190 days ago

A friend of mine was just telling me how he asked GPT-3 to write a simple program in Prolog and it seemed to get it right. He didn't try compiling it, but he has enough experience w/ Prolog to say that it was more or less correct.

I'm pretty cynical on LLMs(i.e. they're not intelligent and won't take all our jobs soon), but am coming around on their importance and capabilities.

seydor 1191 days ago

I m not sure it's overrated, but the concerns are very real.

We love the model because it speaks our language as if it's "one of us", but this may be deceiving, and the complete lack of model for truth is disturbing. Making silly poems is fun but the real uses are in medicine and biology, fields that are so complex that they are probably impenetrable to the human mind. Can Reinforcement learning alone create a model for the truth? The Transformer does not seem to have one, it only works with syntax and referencing. How much % of truthfulness can we achieve, and is it good enough for scientific applications? If a blocker is found in the interface between the model and reality, it will be a huge disappointment

nico 1191 days ago

> model for the truth?

Without sensing/experiencing the world, there is no truth.

The only truth we can ever truly know, is the present moment.

Even our memories of things that we “know” that happened, we perceive them in the now.

Language doesn’t have a truth. You can make up anything you want with language.

So the only “truth” you could teach an LLM, is your own description of it. But these LLMs are trained on thousands or even million different versions of “truth”. Which is the correct one?

visarga 1190 days ago

There is a paper showing you can infer when the model is telling the truth by finding a direction in activation space that satisfies logical consistency properties, such as that a statement and its negation have opposite truth values. Apparently we can detect even when the model is being deceitful.

https://arxiv.org/abs/2212.03827

Another approach - a model can learn the distribution - is this fact known or not in the training set, how many times does it appear, is the distribution unimodal (agreement) or multi-modal (disagreement or just high variance). Knowing this a model can adjust its responses accordingly, for example by presenting multiple possibilities or avoiding to hallucinate when there is no information.

stormfather 1191 days ago

I think for practical purposes you could hold that text from wikipedia or scientific papers if true, for example. The issue I think OP is referring to is if a LLM can refer back to these axiomatically true sources to ground and justify its outputs like a human would.

nico 1191 days ago

Well in that case, maybe the debate is: do we want that? Why?

valine 1190 days ago

If you can trust the model is at least as accurate as wikipedia then it becomes a drop in replacement for every task you do that requires wikipedia.

There are a whole range of tasks that can’t be done today with an LLM because of the hallucination issues. You can’t rely on the information it gives you when writing a research paper, for example.

Barrin92 1190 days ago

For starters because one of the first products people decided to use these models for is a search engine, and I don't think it is a stretch to argue that search engines should have a positive relationship, rather than indifference, towards facts and the truth.

PartiallyTyped 1190 days ago

You can make up any reality that you want, just consume these first and don’t ask me where I found them.

In all seriousness though, what you are asking is whether an objective reality exists which is not a settled debate. There is also the whole solipsism thing though many disregard as a valid view of the world because it can be used to justify anything and is not a particularly interesting position.

Of course there is also the whole local realism thing with QM and of course the whole relativity thing and time flowing at different speeds destroying a universal “now”.

Then there is the whole issue with our senses being fallible and our brains hallucinating reality in a manner that is as confident as GPT3.5 is when making up facts.

In fact, it’s all just information and information doesn’t need a medium.

glitchc 1190 days ago

Our senses lie to us all the time. What we perceive may have strong to almost no correlation to reality. Can you see in the ultraviolet? No human can. Flowers look completely different. Same goes for sounds and smells.

seydor 1190 days ago

It can be exact and self-consistent, you can teach the rules of mathematics . There are some things that are provably unprovable but thats a known fact.

nico 1190 days ago

You can still express contradiction in math.

The rules don’t determine the interpretation.

An LLM will pretty much always respect the rules of language, but it can use them to tell you completely fake stuff.

seydor 1190 days ago

math is language

visarga 1190 days ago

In exact domains you can often validate the model with numerical simulations, or use the simulations for reinforcement learning or evolution. The model can learn from outcomes, not only from humans. In biology it is necessary to validate experimentally, like any other drug or procedure.

aaroninsf 1191 days ago

I am not so sure,

there seems to be accumulating evidence that "finding the optimal solutions" means (requires) building a world model. Whether it's consistent with ground truth probably depends on what you mean by ground truth.

Given the hypothesis that the optimal solution for deep learning presented with a given training set, is to represent (simulate) the formal systemic relationships that generated that set, by "modeling" such relationships (or discovering non-lossy optimized simplifications),

I believe an implicit corollary, that the fidelity of simulation is only bounded by the information in the original data.

Prediction: a big enough network, well enough trained, is capable of simulating with arbitrary fidelity, an arbitrarily complex system, to the point that lack of fidelity hits a noise floor.

The testable bit of interest being whether such simulations predict novel states and outcomes (real world behavior) well enough.

I don't see why they shouldn't, but the X-factor would seem to be the resolution and comprehensiveness of our training data.

I can imagine toy domains like SHRDLU which are simple enough that we should be able to build large models well enough already to "model" them and tease this sort of speculation experimentally.

I hope (assume) this is already being done...

JohnFen 1191 days ago

> there seems to be accumulating evidence that "finding the optimal solutions" means (requires) building a world model.

Was this ever in doubt? This has been the case forever (even before "AI"), and I thought it was well-established. The fidelity of the model is the core problem. What "AI" is really providing is a shortcut that allows the creation of better models.

But no model can ever be perfect, because the value of them is that they're an abstraction. As the old truism goes, a perfect map of a terrain would necessarily be indistinguishable from the actual terrain.

ChatGTP 1191 days ago

But no model can ever be perfect, because the value of them is that they're an abstraction. As the old truism goes, a perfect map of a terrain would necessarily be indistinguishable from the actual terrain.

Not sure why but I find this incredibly insightful…

nico 1186 days ago

> Prediction: a big enough network, well enough trained, is capable of simulating with arbitrary fidelity, an arbitrarily complex system, to the point that lack of fidelity hits a noise floor.

That is a pretty good description of human brains/bodies. You could also say that quantum physics is where our noise floor might be.

IIAOPSW 1190 days ago

Here's an alternative to a model for truth. There is no truth, only power. Suppose we completely abandon logical semantics and instead focused on social semantics. Instead of the usual boolean True/False variables and logic relations, we'll have people valued variables and like/dislike relations. I system entirely for reasoning about the amount of pull and persuasion is present without ever circuiting down to any ground truth reasons. In other words, a bullshit reasoning system. Can ordinary truth reasoning be jerryrigged out of this system?

seydor 1190 days ago

Yes, it s called empiricism

IIAOPSW 1190 days ago

This was rhetorical. My point was that a system or model which cares about something other than the truth can, upon reaching a certain level of sophistication, be able to handle reasoning about truth. Eg, an AI that cares entirely about approval for what it says rather than the actuality of what it says could still end up reasoning about truth, given that truth is most heavily correlated with approval. I reject the premise that there has to be an a priori truth model under the hood.

earthboundkid 1190 days ago

RLHF is basically just applying social power to the machine. It’s used for good (ChatGPT won’t help you spread Nazi memes) and hegemony (ChatGPT won’t help you overthrow capitalism).

draxil 1191 days ago

We are all struck with the novelty of generative AI, it needs time to settle. People will throw the universe at the wall and see what really sticks.

To my mind generative AI is great at finding needles in the haystack of stuff we already know. Of course it just as often gives you a fake needle right now, just to see if you notice.

On the other hand "traditional"/predictive AI is often better at the things we don't already know or understand.

GuB-42 1191 days ago

Is there a fundamental difference?

I mean, the only thing GPT does is predict the next word, which makes it not so different from a compression algorithm. And diffusion models (the image generating stuff) are essentially fancy denoisers.

Depending on how you assemble the big building blocks, you get generation or you get prediction.

baq 1190 days ago

GPT-3.5 is not a Markov chain, this is trivially true. While ‘predicts the next word’ is true, the mechanism of it is of interest and that is most certainly not trivial.

sweezyjeezy 1190 days ago

Depends how far you take the word 'fundamental', on the one hand yeah most DL systems are trying to predict something, and they generally have some concept of compression built in. But in terms of the steps to curate a dataset, train, test, iterate and actually use the model for a given end goal - they are pretty fundamentally different.

sharemywin 1190 days ago

I think the thing is though in Large multi models you give it all the data and test it against everything. And it generally does better across most of the benchmarks.

sweezyjeezy 1190 days ago

That depends entirely on the use-case - for example if you wanted to build an AI to operate a self-driving car, just training on unlabelled data scraped from the internet is only going to get you so far. It doesn't learn how to do EVERYTHING (not yet at least).

DeathArrow 1191 days ago

>investors have become only interested in companies building generative AI, relegating those working on predictive models to “old school” AI.

If that is the definition of old school AI, I wonder how symbolic AI should be named.

TuringTest 1191 days ago

> If that is the definition of old school AI

It is not. Symbolic, deductive reasoning engines have the same claim to being old-school AI as predictive statistic models.

snapcaster 1191 days ago

how about "useless with no successes of note" AI?

qorrect 1191 days ago

What ? We all use it everyday, it's just that as soon as the problem was solved with 'old AI', everyone forgot it was an AI problem.

TuringTest 1191 days ago

I hope you've never used the power grid or parcel shipping, as those are heavily optimized using symbolic AI.

efitz 1190 days ago

I think that 100% of the actually useful use cases for generative AI could be described in two words: “supervised autocomplete”.

orangecat 1190 days ago

That's not wrong, but an ideal autocompleter is a near-omniscient superintelligence. "The optimal approach to curing Alzheimer's is ______". "The proof of the Riemann hypothesis is as follows: ______". "The best way for me to improve my life is _______".

kneebonian 1190 days ago

I think the big difference is just being an Autocompleter is less concerned with generating something that is truthful, as in reflects the real world as we understand it described by physics, vs simply spitting out something that sounds good.

Although we do have a litmus test in asking it "What is the meaning of life the universe and everything?"

gweinberg 1190 days ago

Yes, exactly. An autocompleter is saying what the next words probably would be, not what it should be. It's like a chess program that tries to find the most likely move that a huan would make in the position rather than the best move.

efitz 1190 days ago

That’s why I said “supervised” - in other words, someone competent in the domain and context is examining the output and correcting or discarding as necessary before use.

“Unsupervised generative AI” is useless IMO.

goldenkey 1191 days ago

When the generative model is autoregressive (autocomplete), it can easily be used as a predictor. All of the state of the art language models are tested against multiple choice exams and other types of prediction tasks. In fact, it's how they are trained...masking - https://www.microsoft.com/en-us/research/blog/mpnet-combines...

For example: "Multiple-choice questions in 57 subjects (professional & academic)" - https://openai.com/research/gpt-4

croes 1191 days ago

Being good at standardized tests isn't really a good measure.

What happens with completely new questions from totally different subject. The generative model will produce nonsense.

k8si 1191 days ago

For GPT4: "Pricing is $0.03 per 1,000 “prompt” tokens (about 750 words) and $0.06 per 1,000 “completion” tokens (again, about 750 words)."

Meanwhile, there are off-shelf models that you can train very efficiently, on relevant data, privately, and you can run these on your own infrastructure.

Yes, GPT4 is probably great at all the benchmark tasks, but models have been great at all the open benchmark tasks for a long time. That's why they have to keep making harder tasks.

Depending on what you actually want to do with LMs, GPT4 might lose to a BERTish model in a cost-benefit analysis--especially given that (in my experience), the hard part of ML is still getting data/QA/infrastructure aligned with whatever it is you want to do with the ML. (At least at larger companies, maybe it's different at startups.)

potatoman22 1190 days ago

This is true, but the humans to develop the non-LLM solution are expensive and the OpenAI API is easy.

jedberg 1190 days ago

The real innovation will come one someone uses a Generative AI to make something, and then use a predictive AI to rate it's accuracy, making it go again until it passes the predictive AI.

Basically a form of adversarial training/generation.

ChikkaChiChi 1190 days ago

Bilateral "thinking" makes sense, and you can even feed generative AI back into itself for simple error correction.

I believe that we'll see the most success/accuracy once you have generative AI compare itself to itself, monitored by a GAN, which then spits out it's answer while retaining some knowledge as to how it came to the conclusion. A tricameral mind.

arrow7000 1190 days ago

Isn't this exactly how GANs work already?

jedberg 1190 days ago

Yes. But from I've seen no one has applied it to the latest Generative AIs.

dereg 1190 days ago

I’m pretty sure Anthropic’s Claude is doing that.

https://scale.com/blog/chatgpt-vs-claude

arrow7000 1190 days ago

Maybe an adversarial approach was used in training these models in the first place?

sharemywin 1190 days ago

It was they were' trained using reinforcement learning with human feedback to create the critic.

jedberg 1190 days ago

I hadn't thought about human feedback being an adversarial system, but I guess that makes sense, since it's basically a classifier saying "you got this wrong".

whiplash451 1191 days ago

The author might be missing the fact that generative models can be used for "old-school" prediction tasks, with quite outstanding results.

Their power does not only lie in their ability to _generate_ new data, but to _model_ existing data.

jasonjmcghee 1191 days ago

The biggest issue with using them in this way is how alien the failure modes are.

Interpretable models with transparent loss functions are easy to grok.

How LLMs might fail on a classic task is (afaict right now) difficult to predict.

whiplash451 1191 days ago

What is not transparent in the cross-entropy loss used in a large number of deep nets?

jasonjmcghee 1191 days ago

I think there was a breakdown in communication here.

If I train a classic deep net as a classifier and there are 5 possible classes, it will only ever output those 5 classes (unless there's a bug).

With ChatGPT, for example, it could theoretically decide to introduce a 6th class - what I would call an alien failure mode, even if you explicitly told it not to.

I think formally / provably constraining the output of LLM APIs will help mitigate these issues, rather than needing to use an embedding API / use the LLM as a featurizer and train another model on top of it.

calf 1190 days ago

Formal proof is problematic because English has no formal specification. Some people are working on this, it's a nascent area bringing formal methods (model checking) to neural network models of computation. But it's an interesting fundamental issue that arises there, if you can't even specify the design intentions then how do you prove anything about it.

redox99 1190 days ago

I wonder how good multimodal GPT4 is at ImageNet.

(You give it the image and prompt it with the 1000 classes and ask it which one the image belongs to).

I'm surprised ClosedAI didn't include this kind of benchmark. I guess it doesn't do too well?

sharemywin 1190 days ago

Here's something on Clip

https://www.pinecone.io/learn/zero-shot-image-classification...

kyleyeats 1191 days ago

I'm working on an old-school AI personal project right now. I don't know how long that lasts. The generative stuff is more and more tempting. It rewards the horrible micromanager in me like nothing else.

EGreg 1191 days ago

Yes! Just like HN is anti blockchain but super pro AI. It seems most applications of generative AI at scale will havd a huge negative for society, far worse than anything blockchain could have brought about.

peter_retief 1191 days ago

"So has generative AI been overhyped? Not exactly. Having generative models capable of delivering value is an exciting development. For the first time, people can interact with AI systems that don’t just automate but create an activity of which only humans were previously capable."

Good answer but I feel that most users/people do not understand the difference between generative and predictive machine learning and that will probably cause unpredictable failures and false flags. So yes it has been overhyped in my opinion

kenjackson 1191 days ago

IMO, it has been underhyped. We're seeing things with LLMs that a decade ago I'd say was multiple decades out, if not more.

We're just years into generative approaches. And I think we'll more combinations of methods used in the future.

The goal of AI has never been to build an all knowing perfect system. It has also never been to replicate the way the human brain works. But its been to build an artificial system that can learn -- and AGI specifically to be able to give the appearance of human learning.

I feel like we've turned this corner where the question now is, "Can we build something that knows everything that has been documented and can also synthesize and infer all of that data at a level of a very smart human". The fact that this has become the new bar is IMO one of the biggest tech changes in history. Not the biggest, but up there.

beepbooptheory 1190 days ago

Trying to imagine this stuff being even more hyped and I just don't think its possible. People around here are practically ready to sell their first born child to OpenAI/Microsoft at this point.

PaulDavisThe1st 1190 days ago

> Can we build something that knows everything that has been documented and can also synthesize and infer all of that data at a level of a very smart human

The word "know" is doing some heavy lifting there, as is "synthesize" and "infer".

kenjackson 1190 days ago

By "know" I meant has access to. This is a very "database" sense of the word "know".

Now "infer" and "synthesize" I meant the standard human definition of "synthesize" and "infer". In my interactions with relatively bright people, they really expect ChatGPT to be able to synthesize text at the level of a very sharp HS/college student. They don't want simple regurgitation of a text or a middel school analysis -- they want/expect ChatGPT to analyze nuance, and pull in its vast database to make connections to things that maybe aren't apparent at first glance.

The bar has raised so high so quickly -- it's crazy.

peter_retief 1190 days ago

I am very excited about the possibilities of AI/ML but am concerned as to how it is been sold to the public.

Xelynega 1191 days ago

I think the issue is more with people marketing/talking about them as "AI". When I think AI I think of something like Skynet. I would assume something like Skynet would be good at chess, able to generate new text, and synthesize new images. I think when shown novel algorithms that can do those things and told by the people selling the algorithms that they are "AI", it's hard to disagree since they quack like an AI so it's easy to accept that these are the same "artificial intelligence" concept in our brains which we previously only had examples of from fiction.

Basically I think it's overhyped by the use of the term "AI" and how easy we are to accept it generally. Some aspect of them being generative models could have been the term used to market/describe them, but instead a much broader term is used.

phonebucket 1190 days ago

There is much more to generative models than building out language models and image models.

Generative models are about characterising probability distributions. If you ever predict more than just the average of something using data, then you are doing generative modelling.

The difference between generative modelling and predictive modelling is similar to the difference between stochastic modelling and deterministic modelling in the traditional applied mathematical sciences. Both have their place. Neither is overrated.

Grab the best tool for the job.

f0ld 1190 days ago

Or could it be possible that it was always going to end up like a black box it seems is it not? We will never truly understand the inner workings while it solves every problems that can be numbered by which it couldn't be solved with hardline algorithms previously. It's literally calling higher dimensional egrigores for answers or some blood magic Genie.

tolciho 1190 days ago

As stated by John McCarthy--"I invented [AI] because we had to do something when we were trying to get money for a summer study" (the Lighthill debate)--this article passes the AI sniff test, or "please remember us predictive AI folks when you go to dole out your money" as all that is solid melts into PR.

uoaei 1191 days ago

Generative methods per se are pretty sick and dope, and are still useful for many things beyond art generation.

glitchc 1190 days ago

I see and I hear:

"Don't be dazzled by AI computer vision's creative charm! Classical computer vision, though less flashy, remains crucial for solving real-world challenges and unleashing computer vision's true potential."

Meant for those in classical computer vision before ML ate the field.

potatoman22 1190 days ago

If by classical CV they mean image processing and machine vision, I would say that's still true.

pretendscholar 1191 days ago

I’m not sure I understand a definition of AI that doesn’t include the ability to generate things.

WoodenChair 1191 days ago

> I’m not sure I understand a definition of AI that doesn’t include the ability to generate things.

It depends how you define "generate." For example, is software that controls a robot arm generating anything? I guess it's generating the movements of the arm. But when people use the term "generative" with regards to machine learning models right now, they generally mean content—e.g. text or images for consumption.

yunwal 1190 days ago

Generative has a more technical meaning than that.

Generative AI is essentially the opposite of a classifier. You give it a prompt that could mean many different things, and it gives you one of those things. A robotic arm could use generative AI, because there are many different sets of electrical signals that would result in success for, say, catching a ball.

Classification is an example of a non-generative AI in that there is only 1 correct answer, but it still requires machine learning to acquire the classification function.

TuringTest 1191 days ago

You can use AI to validate things, i.e. to check that they conform to some specification.

You may twist the language to say that they are generating a list of validations and errors, but even then it's definitely a different use case than merely creating new items.

croes 1191 days ago

The point is that AI is more than just generating more of the same data it was trained on.

wslh 1190 days ago

I would add that there are logic deductive and constraint systems that are more classical and work in some areas. It is not about a single method but we should he aware that AI is a superset of what we see.

ElijahLynn 1190 days ago

This article could be improved by starting off stating what some examples of Predictive AI is, as they did with Generative AI.

patrulek 1190 days ago

Old-school, huh. The skynet is closer than we think i guess.

kulkarniankita 1190 days ago

I love it but also waiting for the hype to settle down.

nathias 1191 days ago

after the era of low hanging fruits of generative AI will be over I'm sure there will be a return to other approaches

all2 1191 days ago

From TFA:

    TLDR; Don't be dazzled by generative AI's creative charm! Predictive AI, though less flashy, remains crucial for solving real-world challenges and unleashing AI's true potential. By merging the powers of both AI types and closing the prototype-to-production gap, we'll accelerate the AI revolution and transform our world. Keep an eye on both these AI stars to witness the future unfold.