Hacker News new | ask | show | jobs
by chron83602 744 days ago
> And I am not talking about predicting the future, but more predicting the next action to take based on current state, sensor data in a more seamless way.

Like self-driving cars?

Self-driving cars is an engineering problem, let alone an AI problem, and we still cannot solve it despite trillion dollar economic incentives.

Just putting together some LLMs on a fuckton of data does not work. Tesla tried that, and failed.

3 comments

Tesla has been using sparse data to train their models, because they needed to prioritize fast on device inference.

Completely different solution applied to a completely different problem with completely different risk and quality tolerances with completely different mitigations.

Self-driving cars don't use LLMs so the comparison is invalid, it's a different technology.
The point is to try and see if LLM's wide general knowledge can have advantage in something like sensory data + action learning as well. Current self driving models don't have that.
Actions typically consist of a series of small steps.

Given that LLMs are inaccurate around 5-10% of the time each step will compound the error rate until you are better off flipping a coin.

I don't understand this stretch logic. It absolutely depends on the type of problem where they are inaccurate, how well trained they are in it, there is no way you can extrapolate like this.

You can ask them to do math equation which takes steps and if they are trained in that for certain problems they are accurate near 100 percent of the time.

Like ask gpt-4o to solve different variations of

"""What is the answer to 2x + 7 = 31?"""

If the numbers are of similar magnitude and simplicity, it will follow the same steps and be right 99%+ times, and I'm only not saying 100%, because I haven't tried it enough, but I don't see it being wrong.

For example """What is the answer to 2x + 4 = -6?"""

Just run a test yourself. Do random integers within 0 - 20, it will definitely not be incorrect 5% - 10% time. It will be correct 99%+ time.

Where is this number 5% - 10% even coming from? You could also keep asking it "What is the capital of France?" and it's going to be right 99%+ of the time.

You are conflating asking a single question to ChatGPT versus AI agents which typically need to interact with an LLM multiple times.

And the 5-10% is on average and gets significantly worse as you expand the context length which is also something you want for an agent.

It depends on the problem right. It would have 0 accuracy one some problems and near 100 percent on others.

Based on what you are attempting to do you could get any average in the end.