Hacker News new | ask | show | jobs
by mattway 1776 days ago
The text to image results look pretty awesome, but I feel like they all have a "neural net" quality to them. Whilst the style transer results come out more unique. Do you think it would become possible to combine the two techniques? Eg. Generate me a pic of this text, but in this images style?
2 comments

> but I feel like they all have a "neural net" quality to them

it is getting significantly better though! The first AI art generators were just dreams now we're getting decent stuff as long as you stick to nature concepts.

Here's one I got for:

"a starry, dark night sky over vast plains of desert"

https://creator.nightcafe.studio/creation/70lx2Il45nK1rmcE1D...

it's definitely getting there!

The Colab notebooks actually have a "target image" parameter. I haven't added it to NightCafe Creator yet, and haven't even experimented much with it on Colab, but it's definitely on my to-do list.

However, adding keywords like "watercolour painting", "van gogh painting" etc goes a long way to getting more precise results.

One big difference between style transfer and GAN art is that style transfer never changes the shape of things. E.g. you can't put in a woman's face and a cubist painting and get out a rearranged version of the face (as cubists do). It will have all the hallmarks of cubism, but it's still recognisable as the original photo.

With GAN art though, you can put that photo in as a start image, say "Cubist painting of a woman's face", and the GAN will actually rearrange the face like a cubist would.