Hacker News new | ask | show | jobs
by jonwinstanley 1132 days ago
Sure, anyone that uses ChatGPT knows it's currently not perfect.

But there's a presumption that these tools are going to keep improving over time. Which is presumably why the AI hype is so strong.

Whether AI ends up displacing people from their jobs in the long term, well, that's impossible to know. Just because no technological advancement has ever done that in the past doesn't mean it will never happen in the future.

2 comments

As long as the accuracy of an LLM’s output is unknowable, there’s going to be a pretty hard limit on the kinds of jobs these tools can “replace”. And its not at all clear that this fundamental problem can be fixed at all with the current approach.
A tool doesn't have to obviate a worker's contributions 1:1 to replace them.

If one person can now do the work of 1.5 people, then the number of people needed for a profession shrinks, all else equal. For example, a professional translator may be able to do 2x the work by leveraging LLM/other AI, even though you still need them to validate the results. If productivity doubles, then only half the people are required to meet current needs.

The mistake is in believing that LLM's output should be deterministic to be useful.

Human output is not deterministic.

Fields with text-heavy output are already being upended by this. Being able to summarize long legal briefs, identify contract problems, do classification of discovery documents, or even write first drafts of common legal forms is already upending the legal discipline.

Chat-based customer support agents are seeing 25% productivity improvements based on two-year-old models for new employees, according to a study published in NBER.

Things like BabyAGI and other sequential "do anything" tools appear to be close to useless now, and unfortunately that is what is catching a lot of hype on Twitter. But actual industry applications are much quieter (often NDA) and much more impactful.

It's not about being deterministic.

It's about the LLM itself having any way to determine whether what it says is true.

Not understanding why this is an issue for LLMs but not humans.

This is a simple commercial decision to make governed by three factors.

1. What is the cost of making an error?

2. What is the cost of the human doing the work?

3. What is the likelihood of the human making an error?

It's just evaluating how much more likely AI is to make an error than a human, by the cost of that error, set against the savings by using fewer humans.

Look at the legal profession. Sometimes the cost of an error is high, but usually it is not. There are already tons of little errors in contracts and discovery, and today they're all human. And people are very expensive. There is a giant swath of legal work that looks very attractive to automate at less than 100% accuracy.

Customer service: people offer poor customer service all the time, and usually the cost of that error is low. Human customer service isn't as expensive as legal work, but it's still relatively expensive. Very attractive to automate at less than 100% accuracy.

> Not understanding why this is an issue for LLMs but not humans.

Because humans have the capability to understand where their information comes from, and thus give enough meta-information to evaluate an accuracy rating, even if not all of them are good at it all the time.

I understand that there has been some effort to build this capability into LLMs, and that it works a little bit for some of them, but it is not something that most of them are fundamentally capable of.

I have met many a human that, despite having that capability, will just choose not to use it to the point of getting violently emotional to protect their ignorance and viewpoints.

Whereas, even though their capabilities are lacking, as someone who is disabled, I've found working with LLM to be much more effective. I've had the medical community gaslight me and actively use processes that make things worse. I've had the educational community restrict me from accessing education. I've had the legal community weaves webs of "yes the policy states that we must comply but that's actually a suggestion." Even when it comes to interacting with people socially, my experience has been one of violence and trauma.

So, in the face of everything it can't do, I'd much rather place my bets there than with humans. I myself am human. What human do you know would spend effort asserting that? One who is constantly dehumanized by society. And I think there is a lot of opportunity for disabled people to use this tech to be accepted as humans? Or to at least gaslight society back into thinking we are?

People won't be able to maintain "The Status Quo" and, frankly, I can't wait.

Humans can make mistakes and lie, and we've been able to deal with it by checking their work, giving feedback to help them improve, placing less trust in those who habitually lie, etc..

LLMs making mistakes and "hallucinating" can be dealt with in similar ways, and as this is an open area of research with lots of proposed solutions and probably many more in the years to come, we do/will have plenty of other ways to deal with it too.

It’s not the first time we’ve been here , either, with AI although t this time it’s a bit more in the public , ie retail sphere. There are people who will confidently tell you that LLM are the next transistor level invention, and people who will tell you it’s more incremental, like eg an electric pressure cooker - improvement in some ways over what came before, got lots of people using them, but not fundamental. I’m sure there is a better example.

Anyway, the truth is nobody actually knows at this point .