| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by not2b 1700 days ago

You might need some filtering. I got the following output for my 7th or 8th try (I did s/ig/*/g because I hate typing this):

noun.

n*gerbar

n*·ger·bar

a bar from which the beer and liquor are sold

"that party got a whole lot cooler this week with a n*gerbar refreshment"

a word that does not exist; it was invented, defined and used by a machine learning algorithm.

3 comments

turtlesoup 1700 days ago

Ah crap, good call. I did a first-pass at filtering for the ones displayed on the site but my regex must have had some misses.

link

growt 1700 days ago

Not as bad, but also probably something for a "blocklist": cockshop a shop selling flannel and scarves

link

sillysaurusx 1700 days ago

It's hilarious that not even this GPT side project can escape the need for OpenAI-style output filtering.

Someone really ought to make an open source lib to solve this once and for all...

link

jazzyjackson 1700 days ago

filtering objectionable content actually requires you to build a strong AI model capable of being offended itself, so it knows to hold its tongue in mixed company

edit: lest i leave this comment totally useless, the chatbot engine “chatscript” has pretty good capabilities for disambiguating word meaning and classifying the meanings into “badword” and “verybadword” - its free/libre software and very high performance.

https://github.com/ChatScript/ChatScript/

link

mensetmanusman 1700 days ago

Objectionable is subjective.

link

lanstin 1700 days ago

So is skillful language use. We nonetheless think it is achievable without using wetware.

link

lindseymysse 1699 days ago

Or, better yet: we should all work to make sure every human being has their basic needs met and is treated with respect, and then words like this wouldn't have as much power.

link

mattnewton 1700 days ago

I think actually solving this problem is somewhat loosely reducible to human level intelligence in language understanding. The next best thing is a pile of patches that fix cases of the increasing creativity of the human adversaries to the system as we become aware of them.

link

air7 1699 days ago

You should aim higher than that. After all it was human level intelligence that got a proffesor fired for saying "nèige", a Mandarin filler word, in a lecture about filler words in other languages!

https://www.lamag.com/citythinkblog/usc-professor-slur/

link

wodenokoto 1699 days ago

I don’t have access to gpt-3 but have wondered how much adding “without being offensive” to the prompt would help.

On one hand it’s the promise of the algorithm that such things will work, on the other hand I doubt it.

link

cookieswumchorr 1699 days ago

a simple blocklist might be just the cheaper and easier solution. After all, blocklists were, and still are being used to filter human output as well. And fail-safe. A person with a hobby project does not have a legal/pr department to deal with the consequences of AI having a bad day

link

throw10920 1700 days ago

(you might want to read up on the use-mention distinction https://en.wikipedia.org/wiki/Use%E2%80%93mention_distinctio...)

link

asxd 1700 days ago

I have a feeling the parent poster realized they were only mentioning the word, but chose not to write it anyway. For a lot of people, it is a very uncomfortable word after all.

link

gberger 1700 days ago

Please could you explain the relevance of the article to the previous comment?

link

not2b 1700 days ago

Perhaps throw10920 thinks I should have just used the word because I was only quoting the output, so it wasn't really me saying it, but I chose otherwise. I don't need to read up on the use-mention distinction to decide whether to make such a choice.

link