Ask HN: How do companies make their AIs "woke"?

Y	Hacker News new \| ask \| show \| jobs

	Ask HN: How do companies make their AIs "woke"?
	1 points by litetime 847 days ago
	It's a common quip to say that LLMs are black boxes and that we have no idea how they "think". Of course that was always an exaggeration, but there is some truth to in that answers are generated from billions of miniscule connections between entities that are nearly impossible to reason about. How exactly then are they able to turn up and down the woke slider for their models? It seems impossible to tune them manually for a near infinite variety of use cases. But I could be wrong. Anyone have an intuitive explanation of how they get it done? Edit: I wasn't trying to use the word "woke" to make any sort of political statement, although I see now how maybe it wasn't the best choice of words. The impetus for this post is the recent controversy around Google's Gemini being, uhhh, "overly tuned?".

5 comments

proc0 847 days ago

For the most part it's probably reinforcement learning from human feedback (RLHF). This incorporates humans in the training loop and its done for alignment purposes (which is overall a good idea, but it does depend on who exactly the AI is aligning with). There may also be other areas where human bias can seep in, like the massaging of the training data, but more likely the biggest factor is the direct feedback training done by a select number of people.

https://aws.amazon.com/what-is/reinforcement-learning-from-h...

link

nonrandomstring 847 days ago

"woke" isn't a great qualifier. But you could re-frame the question as how do "AI" models encode and search within a political sentiment space? Here's some political spaces, usually in 4, 6, 9 and 12 dimensional variants [0..2] There are also personality axes, again with high or low dimensional nuance. Unlike "real" signals there's no component analysis to prove that these are orthogonal. If the training data carries-in any salient features you can get an NN to tell you where "woke" or "fascist" or whatever is within these for some task, then minimise or maximise it for some quality.

[0] https://www.thebehavioralscientist.com/glossary/political-co...

[1] https://politicaltests.github.io/12axes/

[2] https://9axes.github.io/

[3] https://en.wikipedia.org/wiki/Personality

link

bell-cot 847 days ago

I kinda suspect they're over-cooking the training data.

But with the ability of (so-called) AI's to hallucinate fictitious legal cases and such - pretty undesirable behaviors from either end of the political spectrum - I would not rule out "AI's are just too stupid to know better".

link

EchoChamberMan 847 days ago

Google tried to use prompts to force specific output. the companies use prompts, weights, and humans to try to keep the output sane.

link

allears 847 days ago

Please define 'woke.' As far as I know, it's simply an insult used by Republicans against Democrats.

link

litetime 847 days ago

Sorry I don't mean it as insult or to stir up any political arguments. I guess I could have said "personality", but I was motivated to understand this from the Google Gemini controversy.

link

mtmail 847 days ago

There are attempts to define it https://simple.wikipedia.org/wiki/Woke but I don't understand what the relevance to AI is.

link

michaelmrose 847 days ago

It started out having a specific positive meaning of being aware of social injustice and has in the hands of conservatives transitioned to meaning virtually anything I don't like that could even vaguely be attached however tangentially politically to anything left of Hitler or involving anyone except a white conservative man or his christian conservative wife and obedient white children.

If your minority waitress brings you burned toast it might be woke.

link