Hacker News new | ask | show | jobs
by woodrowbarlow 140 days ago
here's an example of how model censorship affects coding tasks: https://github.com/orgs/community/discussions/72603
4 comments

Oh, lol. This though seems to be something that would affect only US models... ironically
Not sure if it’s still current, but there’s a comment saying it’s just a US location thing which is quite funny. https://github.com/community/community/discussions/72603#dis...
This is called ^ deflection.

Upon seeing evidence that censorship negatively impacts models, you attack something else. All in a way that shows a clear "US bad, China good" perspective.

This is called ^ deflection.

Upon seeing evidence that censorship negatively impacts perception of the US, you attack something else. All in a way that shows a clear "China bad, US good" perspective.

>All in a way that shows a clear "China bad, US good" perspective

Nope.

My comment was neutral on the US - I emphasized the detriment of censorship on models which is true insofar as the US censors models.

Whereas the other comment moves the goalposts from "i dont necessarily see the problem with censorship on models" when presented evidence to the contrary by attacking the US. It was never about US vs. China (to anyone else). That's the deflection.

I can't believe I'm using Grok... but I'm using Grok...

Why? I have a female sales person, and I noticed they get a different response from (female) receptionists than my male sales people. I asked chatGPT about this, and it outright refused to believe me. It said I was imagining this and implied I was sexist or something. I ended up asking Grok, and it mentioned the phenomena and some solutions. It was genuinely helpful.

Further, I brought this up with some of my contract advisors, and one of my female advisors mentioned the phenomena before I gave a hypothesis. 'Girls are just like this.'

Now I use Grok... I can't believe I'm saying that. I just want right answers.

You conversely get the same issue if you have no guardrails. Ie: Grok generating CP makes it completely unusable in a professional setting. I don't think this is a solvable problem.
Why does it having the ability to do something has mean it is ‘unusable’ in a professional setting?

Is it generating CP when given benign prompts? Or is it misinterpreting normal prompts and generating CP?

There are a LOT of tools that we use at work that could be used to do horrible things. A knife in a kitchen could be used to kill someone. The camera on our laptop could be used to take pictures of CP. You can write death threats with your Gmail account.

We don’t say knives are unusable in a professional setting because they have the capability to be used in crime. Why does AI having the ability to do something bad mean we can’t use it at all in a professional setting?

Because corporate aint gonna approve the vendor that lets their dumbass employees generate child porn when the other vendors do not.
This still doesn’t make any sense. If they are worried a dumbass employee would generate CP using the tool, wouldn’t that same employee also download it from the web? Or use a web based tool to generate it?

Any employee that is going to use a corporate AI tool to generate CP is going to use other corporate tools to do worse things. There is no point in worrying about it.

Your boss tells you to choose a vendor for your AI integration. Your options:

Company A - First to the market, Reasonable Cost, Most well known name. Very easy integration.

Company B - Well regarded tools, Higher cost, Better performance and reviews from team. More difficult to integrate

Company C - Reasonably Priced, Performance is reasonable, Has a connection to an extremely controversial individual, Currently being lambasted for being an CP/Revenge Porn generator

Ok, now pretend you're talking to a guy who signs your paycheques. Which one are you NOT gonna pick?

I'm struggling to follow the logic on this. Glocks are used in murders, Proton has been used to transmit serious threats, C has been used to program malware. All can be legitimate tools in professional settings where the users don't use it for illegal stuff. My Leatherman doesn't need to have a tipless blade so I don't stab people because I'm trusted to not stab people.

The only reason I don't use Grok professionally is that I've found it to not be as useful for my problems as other LLMs.

> Ie: Grok generating CP makes it completely unusable in a professional setting

Do you mean it's unusable if you're passing user-provided prompts to Grok, or do you mean you can't even use Grok to let company employees write code or author content? The former seems reasonable, the latter not so much.

Curious why you use abbreviations ? "CP", "MAP", etc just for such.
I'm lazy
ok fair enough
These gender reveal parties are getting ridicolous.