| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by notavalleyman 456 days ago

> A company that publishes libel and harmful content

Do you believe that a company who offers an llm to the public, could be said to have 'published' the generative output?

Llms are day dream machines - is it libel if I tell you that I had a dream about you, where you killed a guy? (Has HN just published a harmful lie about you?)

2 comments

ziddoap 456 days ago

>is it libel if I tell you that I had a dream about you, where you killed a guy?

This isn't even closely analogous. I don't know if you could come up with a more bad faith argument.

This was outputting a lie, presented as a fact, to anyone in the world that searched the name.

There is a difference in context (dream vs. fact), difference in scale, difference in expectation (machine outputting what is advertised as accurate information vs. random chatter on a forum where the expectation of accuracy is not a selling point), different methods of redress (chatter can correct you via comment, not so much with an LLM).

link

notavalleyman 456 days ago

> This was outputting a lie, presented as a fact, to anyone in the world that searched the name.

I think you are wrong.

From my understanding, the complainant opened a new chat window and typed "who is forename surname?"

The daydream machine then daydreamed some output text, as is it's function.

Likewise, you can go now to any llm and ask it a specific question like "what is the minimal cheese principle?" (Which I've just made up) And many will daydream a consistent answer for you. As is their function

link

ziddoap 456 days ago

>The daydream machine then daydreamed some output text, as is it's function.

These are not advertised as daydream machines.

They compete on how accurate they they are against various accuracy benchmarks.

The average person who uses them does so with the expectation of accurate results, you know, as they are advertised. Accuracy and speed are pretty much the entire business model.

link

notavalleyman 456 days ago

No, they generally do not compete on accuracy benchmarks afaik.

GitHub/openai/simple-evals is what I checked here, and no, openai do not compete on accuracy benchmarks as far as I can tell. So I'd be interested in seeing what led you to think that, and also what led you to earlier claim that anyone typing in the complainant's name saw the same hallucination.

link

ziddoap 456 days ago

>No, they generally do not compete on accuracy benchmarks afaik.

"Get Answers" is literally at the top of ChatGPTs landing page. You think the average person interprets that to mean "Get inaccurate answers"?

Google "AI benchmark" and almost every result is an assessment of the accuracy of various models. What do you think they compete on? How do you think they measure the improvement of one model to the next?

Here's OpenAI's "Optimizing LLM Accuracy" https://platform.openai.com/docs/guides/optimizing-llm-accur...

Pop this in Google and see the pages of results about accuracy: site:openai.com "accuracy". To claim that they don't optimize for accuracy confirms to me that you are not discussing this in good faith. Perhaps you are just trying to be contrarian or something, I don't know.

>and also what led you to earlier claim that anyone typing in the complainant's name saw the same hallucination.

Well, it says right in the article that different people received the same result.

Why are the goalposts moving? Actually, nevermind, I don't care to continue the conversation.

link

notavalleyman 456 days ago

I think if you take a few moments to read carefully.

You'll see that AI companies, including openai, are generally not competing on accuracy benchmarks.

For example, here are the benchmarks on which open ai seem to be trying to compete.

MMLU: Measuring Massive Multitask Language Understanding,

MATH: Measuring Mathematical Problem Solving With the MATH Dataset,

GPQA: A Graduate-Level Google-Proof Q&A Benchmark,

DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs,

MGSM: Multilingual Grade School Math Benchmark (MGSM), Language Models are Multilingual Chain-of-Thought Reasoners,

HumanEval: Evaluating Large Language Models Trained on Code,

link

mvdtnz 456 days ago

> Do you believe that a company who offers an llm to the public, could be said to have 'published' the generative output?

Of course they did. What other term could you possibly use for it, when one goes to a website and that website itself hands you content? The content is certainly not user-generated, it's coming from the website.

link

notavalleyman 456 days ago

Maybe a different way of phrasing it would be, if a website embedded a rng generator, and you see the random number "eight", then did the publisher publish the number eight, or did they publish a rng? In my opinion, it's the latter. Similarly if the rng generated the number 666, we wouldn't assume the website is making some kind of biblical commentary. We'd recognise that the rng produced a random generative output, similar to the op situation. So, to impugn the publisher of a random text generator based on the random content .... If chatgpt generates a murder threat, or pro-terorrism content, or otherwise shouts fire in a theater, do you believe openai as the publishers should face arrest?

link

mvdtnz 456 days ago

> do you believe openai as the publishers should face arrest?

Absolutely I do, yes. An LLM is not a random number generator. It is a tool built for the sole purpose of generating content.

link

notavalleyman 456 days ago

It's very shocking to me that you would reply in the affirmative.

I think you're saying that companies who host generative AI web services, ought to be legally liable if the ephemeral generative content is illegal.

In your mind, should AI companies try and engineer protection from this huge legal risk? It seems criminally insane for a company to host an AI if they're going to be legally liable for the ephemeral daydream content. You should be shorting goog, meta and msft at the very least, because I make their models generate illegal content every night before bed

link

mvdtnz 456 days ago

Do I think that companies that host and share illegal content should be held liable? Of course I do. How could you possibly feel any other way?

I'm not shorting anything because I'm not a gambler and my opinion on what should be illegal has no basis in what actually is illegal in USA, a country I have never set foot in.

link

Nullabillity 456 days ago

> It seems criminally insane for a company to host an AI

Yes?

link

lxgr 456 days ago

The purpose of a system is what it does.

If you publish a "random number generator" that consistently publishes defamatory statements about a person at a much higher frequency than statistically plausible, that's not a random number generator, it's a defamatory statement generator, no matter what you call it.

In court, your intentions might matter just as much as what the system does, but even there, the name you give it is pretty much irrelevant.

link

Nullabillity 456 days ago

The problem with the Alex Jones defense is that they can never quite seem to figure out whether they're just a silly random number generator, or The Revolutionary Future Of Work(tm) that we should be okay with investing trillions of dollars into training and operating.

link