| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by c0decracker 1429 days ago
	Fundamentally I have two categories of issues I see with DALL-E, but please don't get me wrong -- I think this is a great demonstration of what is possible with huge models and I think OpenAI work in general is fantastic. I will most certainly continue using both DALL-E and OpenAI's GPT3. (1) Between what DALL-E can do today and commercial utility is a rift in my opinion. I readily admit that I am have not done hundreds of queries (thank you folks for pointing that out, I'll practice more!) but that means that there is a learning curve, isn't it? I can't just go to DALL-E, mess with it for 5-10 minutes and get my next ad or book cover or illustration for my next project done? (2) I think DALL-E has issues with faces and human form in general. Images it produces are often quite repulsive and take the uncanny valley to the next level. I absolutely surprise myself when I noticed thinking that images with humans DALL-E produced lack of... soul? Cats and dogs on the other hand it handles much better. I done tests with other entities --- say cars or machinery -- and it generally performs so so with them too, often creating disproportionate representations of them or misplacing chunks. If you're querying for multiple objects on a scene it quite often melds them together. This is more pronounced in photorealistic renderings. When I query for painting-style it works mostly better. That said every now and then it does produce a great image, but with this way of arriving at it, how fast I'll have to replenish those credits?.. :) All in all though I think I am underwhelmed mostly because my initial expectations were off, I am still a fan of DALL-E specifically and GPT3 in general. Now when is GPT4 coming out? :)