| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sillysaurusx 1700 days ago
	It's hilarious that not even this GPT side project can escape the need for OpenAI-style output filtering. Someone really ought to make an open source lib to solve this once and for all...

4 comments

jazzyjackson 1700 days ago

filtering objectionable content actually requires you to build a strong AI model capable of being offended itself, so it knows to hold its tongue in mixed company

edit: lest i leave this comment totally useless, the chatbot engine “chatscript” has pretty good capabilities for disambiguating word meaning and classifying the meanings into “badword” and “verybadword” - its free/libre software and very high performance.

https://github.com/ChatScript/ChatScript/

link

mensetmanusman 1700 days ago

Objectionable is subjective.

link

lanstin 1700 days ago

So is skillful language use. We nonetheless think it is achievable without using wetware.

link

lindseymysse 1699 days ago

Or, better yet: we should all work to make sure every human being has their basic needs met and is treated with respect, and then words like this wouldn't have as much power.

link

mattnewton 1700 days ago

I think actually solving this problem is somewhat loosely reducible to human level intelligence in language understanding. The next best thing is a pile of patches that fix cases of the increasing creativity of the human adversaries to the system as we become aware of them.

link

air7 1699 days ago

You should aim higher than that. After all it was human level intelligence that got a proffesor fired for saying "nèige", a Mandarin filler word, in a lecture about filler words in other languages!

https://www.lamag.com/citythinkblog/usc-professor-slur/

link

wodenokoto 1699 days ago

I don’t have access to gpt-3 but have wondered how much adding “without being offensive” to the prompt would help.

On one hand it’s the promise of the algorithm that such things will work, on the other hand I doubt it.

link

cookieswumchorr 1699 days ago

a simple blocklist might be just the cheaper and easier solution. After all, blocklists were, and still are being used to filter human output as well. And fail-safe. A person with a hobby project does not have a legal/pr department to deal with the consequences of AI having a bad day

link