Hacker News new | ask | show | jobs
by happymellon 824 days ago
I have yet to find a text to image generator that works. Whenever I try to generate an image, for example something like Christmas cake, I get

Google image search: The results are pictures of that nice dark cake filled with currents, raisens and cherries.

TTI: Have a cheese cake, or perhaps a gingerbread house?

They seem to be able to do all sorts of great images, but never what I want.

2 comments

what is a Christmas cake.

you are asking it to do something humans cant.

Christmas cake is just not enough context and it isn't a "thing".

i would give you a picture of Christmas themed cake.

This is a larger issue with using AI in general. You have to be able to communicate efficiently.

Imagine asking a random person for a Christmas cake. Would you really expect to find the right thing?

> you are asking it to do something humans cant.

What are you talking about? Google can do it.

https://postimg.cc/mhn6SPVC

For reference:

https://www.bbcgoodfood.com/recipes/make-mature-christmas-ca...

Fruit cake is also something that these image generation tools cannot create. They come up with all sorts of sponge cake monstrosities.

Uhh google showed you a bunch of different things that a bunch of different people call Christmas cakes?

Again fruit cake lacks all context. The definition I. Your head is not contained in the word itself.

This is why people get confused even talking to each other…

Eh? Perhaps this is cultural? I wasn’t aware Christmas cake was a thing but if I had to guess it would just be normal cake with green, red, white colors/icing and Christmas themed decorations. Sure enough, that’s most of what shows up on Google image search. It’s also exactly what SD-XL outputs in my (limited) testing. It doesn’t surprise _me_ too much that a text to image model struggles with that concept because it feels rare and under specified. Having said that, I live in the US south and maybe I’m just ignorant to all that. We mostly eat various pies here.