Hacker News new | ask | show | jobs
by qclibre22 1490 days ago
See the paper here : https://gweb-research-imagen.appspot.com/paper.pdf Section E : "Comparison to GLIDE and DALL-E 2"
1 comments

Imagen seems better at capturing details/nuance from the prompt, but subjectively the DALLE-2 images feel more “real” to me. Not sure why. Something about the lighting?
That feels about right. Imagen has a better text processing model, so it can tease apart the prompt, but DALLE has a rocking image part.