Hacker News new | ask | show | jobs
by minimaxir 212 days ago
One of my tests for new image generation models is professional food photography, particularly in cases where the food has constraints, such as "a peanut butter and jelly sandwich in the shape of a Rubik’s cube" (blog post from 2022 for DALL-E 2: https://minimaxir.com/2022/07/food-photography-ai/ )

For some reason ever since DALL-E 2, all food models seem to generate obviously fake food and/or misinterpret the fun constraints...until Nano Banana. Now I can generate fractal Sierpiński triangle peanut butter and jelly sandwiches.

2 comments

I tried having Claude generate a prompt for Seedream and got this: https://imgur.com/a/6xX5TDE

I can kind of see what you mean in that it went for realism in the aesthetics, but not the object... but that last one would probably fool me if I was scrolling

Here's Nano Banana Pro, which nailed it, IMO, but I had to fudge the prompt:

https://imgur.com/8kMqbBO

"peanut butter, jelly and bread rubik's cube. each smaller cube in the rubik's cube is one ingredient, randomly selected. professional food photography style. ensure it looks like a working rubik's cube"

Those are better than usual: I've gotten generations from earlier models that are just a normal colorful Rubix's cube between two slices of bread.
Nano-Banana does a (inter)stellar job with food based prompts.

https://mordenstar.com/portfolio/wontauns

Make you wonder if they're using all the restaurant review photos on google maps to train