Hacker News new | ask | show | jobs
by dns_snek 844 days ago
Other models have biases in outputs and conform to stereotypes, but when used as designed, as a creative tool, that's not a real issue as long as you can correct the output with a simple prompt adjustment.

The unique issue with Gemini is that it would flat out refuse to follow simple prompts such as "Please generate an image of a white family" because they "weren't diverse and inclusive" enough, but if you changed "white" to any other qualifier, it happily obliged.

4 comments

Open Ai had this same issue when they came out because they modified prompts to always include words to generate a diverse set of races.

They corrected the guidance in their prompt instructions. It became a non issue.

OpenAI isn't involved in the daily lives of over a billion people. Google's mistakes, for now, have much bigger impact and the problem is especially egregious as the company has near-infinite resources for preventing it.

Also, Google has always adopted a fairly radical and political stance in the DEI subject (relative to the cultural average pretty much everywhere), so it's no surprise that people are making a much bigger deal in this case.

I appreciate the explanation, thank you
If you ask it for any kind of family picture, other than White, eg Hispanic, Black, Asian it acts like it should.

Ask for White, and you get a picture without a single Caucasian.

Deity forbid you ask for a Caucasian family though - their A.I. police will pull you over for sensitivity training.

It's hard to see the engine as anything other than hot garbage after a few simple tests like that.

Hey, look at the first website returned in "normal" google for "Please generate an image of a white family": https://www.alamy.com/stock-photo/white-american-family.html...

And Bing? https://www.gettyimages.ch/fotos/white-family

OTOH, if you think that "Please generate an image of a white family" is good use of those behemoth language models and all the rsssources poured in...

The first site is just simple search for all the words in the photo title separately without any connective reasoning.

The 2nd is of a family at white tiger farm. The 3rd is of people wearing white. The 4th photo is of a white dog. The 5th a white ostrich.

"If you don't like it, use something else". Gee, thanks, but we were discussing its flaws.