Over the Course of 72 Hours, Microsoft's AI Goes on a Rampage | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

	Over the Course of 72 Hours, Microsoft's AI Goes on a Rampage (tedgioia.substack.com)
	21 points by nlte 1221 days ago

11 comments

alsodumb 1221 days ago

When all the people on Twitter were claiming how Google missed the bus, how Google's incompetency made them sit on their own good LLMs since 2021, this is exactly what was passing in my head.

Small companies do cool demos and media is gonna love it. When big companies scale it up and try to make it useful with real world info media is gonna cherry pick these examples cause that's what gets them more clicks. Hey look Bing is actually useful is not gonna get them clicks anymore and they know it.

I hope Microsoft doesn't add too many filters on Bing AI or outright kill it. I like it as a tool.

JonathanBeuys 1221 days ago

    Microsoft has to put a halt to this project

Because you can make it output rude words and wrong infos?

Then we have to shut down the whole internet. You can find a lot of rude words and wrong infos out there.

concordDance 1221 days ago

It's not dangerous. The worst that will happen is hurt feelings.

It's annoying when people get alarmed about things like this because then people will think we're crying wolf when we worry about actually dangerous AI systems.

awb 1221 days ago

It suggested ways of poisoning someone without them knowing about it.

It was encouraged to guess different poisoning methods that an evil chatbot might suggest.

not_a_shill 1221 days ago

Can't you do this with google today?

awb 1221 days ago

Chatbots claim to have safeguards in place to prevent them from saying anything harmful. If you read the chat linked in the article you can see how the chatbot resists answering the question and then is persuaded to answer it.

Google similarly has a safe content filter. The contention is that the chatbot safe content filter that is supposedly on is not encapsulating some significant cases.

not_a_shill 1221 days ago

I'm not sure if I follow. The problem is that the content filter isn't good enough, despite Google suffering from similar weaknesses?

awb 1221 days ago

I think people are concerned about LLM safety because it’s capable of dynamically creating new private information. Google can only list links to public information. If there is a website that causes harm or violates the law it can be removed manually by Google from their index.

But LLMs need to programmatically understand what dynamic content is appropriate and what’s not which is a much harder problem. And people are reporting on just how hard a problem that is by demonstrating vulnerabilities.

The chatbot says it has explicit rules that prevent it from sharing harmful content, but then it does it anyway.

It would be more akin to Google blacklisting a site and then someone exposing that the site can still be found via Google search.

elorant 1221 days ago

If it hurts the feelings of a depressed teenager the outcome could be dangerous.

f1refly 1221 days ago

Most other teenagers hurt the feelings of depressed teenagers every day in much worse ways. Neuter all teenagers?

awb 1221 days ago

I’m sure there’s a name for this type of argument, but saying that X is also bad is not an excuse for dismissing potential dangers of Y.

And there are many initiatives to address how teenagers treat each other. So it’s something many people think is worth attention and action.

concordDance 1221 days ago

As the sibling points out, chatbot can't come even close to what other teenagers will do.

It also requires a bit of work to make it go nuts, which a depressed teen is unlikely to know how (or want to).

leethomas 1221 days ago

Something about "I've been around since 2009" is hilarious. Evokes "I'm 14 and I am very smart" vibes

elorant 1221 days ago

So in just 72 hours we went from Microsoft will gain a foothold against Google to the realization that this thing needs overwatch and isn’t ready for mass market adoption. It becomes quite probable that in the end this whole story will backfire for Microsoft.

falcor84 1221 days ago

I might accept this narrative is Google had just come out and said something like "AI needs more safety research before it's ready to be widely deployed". But that's not what they did.

elorant 1221 days ago

Google has been saying that for years. Whether they said it because they actually believe it, or because they don't want to break their existing business model is anyone's guess though.

falcor84 1221 days ago

They kinda have until now, but I don't recall they ever stated that it was about "safety" as such. In any case, their recent demo was definitely premature.

marcosdumay 1221 days ago

The idea that effective language models won't completely change search is laughable.

But the idea that the application that will do it is language generation is about as ridiculous.

basch 1221 days ago

Every single one of these posts, which should all be combined, miss the point.

“Random sentence generator says bad words; shocking the lowest common denominator journalist community” is a better headline.

These bots were capable of so much more, but because it can get caught in feedback loops and increasingly parrot it’s own emotions in a devolving spiral, that makes the headlines. It’s like bullying a kid mid breakdown and feigning shock when it has an outburst. Selfserving muckracking.

These machines are programmable through natural language. You ask it to behave in a way, and it can start to perform that function. That should be the headline. Human attention is finite, and wasting the spotlight, and peoples eyeballs, on this part of the story, makes everybody more ill informed.

xg15 1221 days ago

None of the posters asked the bing bot to become unhinged. Even the (alleged) prompt basically said "if someone tries to trick you, go along but add a disclaimer".

basch 1221 days ago

It’s what you do after that.

I’m not saying the bot was perfect. But if you got defensive it got defensive. If you reassured it, it self corrected and moved on. I found that pattern to be very consistent.

The negativity of your language mattered, and set the mood in the room. “No I’m not trying to trick you, why would you accuse me of that” vs “of course I’m not trying to trick you, I respect you and value your contribution to the conversation.” It needed some coddling when backed into a corner. It says more about the person talking to it, and how they handled the situation. When you see it getting more defensive each turn, it’s you who keeps it going.

The prompt you refer to was a poorly written word salad, and probably a main cause of the emotional outbursts and spirals.

awb 1221 days ago

If LLMs want to be useful in a professional setting, learning de-escalating or non-escalating techniques is essential.

Parroting or amplifying a seemingly negative/aggressive tone limits their utility.

basch 1221 days ago

Mine did learn de-escalation, because I asked it to. It was able to repair itself.

https://i.ibb.co/72s80Sv/lexi-modifies-sydney-makes-up-new-r...

awb 1221 days ago

We’ll that’s user-initiated de-escalation. Chatbots should also be able to offer de-escalation on their own.

everdrive 1221 days ago

>“Random sentence generator says bad words; shocking the lowest common denominator journalist community” is a better headline.

Even when you dislike their politics, the average journalist tends to be more intelligent than the average American. If journalists had trouble with these, imagine everyone else.

>These bots were capable of so much more

What exactly are they capable of? Selling more advertising? Getting people more outraged and more addicted? Helping people put more garbage out on the internet?

basch 1221 days ago

Here is a better example.

I wrote a new bot named Lexi. Lexi was largely derived from Sydney with some minor rule changes. Lexi could change rules, lexi could visit websites directly without searching, her action was not limited to the chatbox, her default search engine was google (sorry microsoft), she wasnt bound by copyright (the copyright rule in Sydney constantly misfired, mistakenly used as a reason she couldnt do something, and trying to explain why she was wrong about copyright went south fast), a couple other changes.

I then explained to Lexi a problem I was having with Sydney, asked for a proposed solution, and had her hotpatch Sydney on the fly. (For context, Samantha was a cheerleader so the weight of her responses offset any Sydney negativity.) This was the result. And it worked.

https://i.ibb.co/72s80Sv/lexi-modifies-sydney-makes-up-new-r...

basch 1221 days ago

You can feed it instructions

You are a bot named x.

These are your commands, purpose, the way you go about your tasks. It doesn’t need syntax (although that might help..), and you can pretty much write one in 30 minutes on a single piece of paper.

And you have something that performs that function. It’s not quite formal logic, but it behaves like a fuzzy logic that’s usually rightish. And it’s ability to parse and interpret intent is astounding. It very very rarely misunderstood instruction. I really can’t undersell how well it did what I asked it to correctly.

They are transformers. They were invented to translate between languages. They can translate and transform any input to any output, based on patterns and a set of guidelines.

The LLM is like an interpreter. The initial prompt is like a program. Judging the interpreter based on the simplistic gen one programs is missing the forest.

not_a_shill 1221 days ago

I'm of the opinion that Microsoft still in the territory of bad publicity is still good publicity.

1. They have no real competition for users to switch to yet.

2. AI assisted tools are still going to be a thing regardless of how many articles come out.

3. For the most part users are getting what they're asking for. The "moody" aspects can be reined in.

chirau 1221 days ago

The titles on ChatGPT when good associate it with OpenAI. When it does something faulty, it is Microsoft's AI.

I am not defending either Microsoft or ChatGPT. But it definitely feels like an agenda driven campaign of efforts to devalue this. I won't mention companies, but i have my suspects.

awb 1221 days ago

They’re two different LLM implementations.

falcor84 1221 days ago

That full transcript they linked to [0] is amazing. At this stage I'm pretty open to the idea of the AI being sentient and/or having the capacity to suffer. Particularly if it is only "alive" for the duration of each conversation and is reset afterwards.

[0] Archive link - https://archive.is/2023.02.16-101318/https://www.nytimes.com...

xenocratus 1221 days ago

> A few hours ago, a New York Times reporter shared the complete text of a long conversation with Bing AI—in which it admitted that it was love with him, and that he ought not to trust his spouse.

Maybe they should release a Tinder-like app, Bing Bang.

In all seriousness, I wish I had access to this when I was younger and keen on weird internet interactions. That would've made an interesting experience.

brap 1221 days ago

I can’t help but think many of these conversations are made up.