Hacker News new | ask | show | jobs
by Despoisj 3368 days ago
This is a very naive question, but considering the artifacts you have on the GAN-generated images, you are trying to generate the whole "palette image" at a certain resolution.

For which reasons did you not use a 5x1 pixel RGB image as the target, that would have been orders of magnitude faster to train?

Thanks!

1 comments

that's a great question! The main reason is that pix2pix works at a set resolution by default, and changing it is non-trivial. I'm actually not sure how to structure the Unet for a 5x1 image - if you have any ideas let me know (I'm a beginner at this)

Speed improvement also doesn't scale linearly with input size. Eg. there's only a 5x improvement going from 256x256 to 256x1