| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jeeceebees 2399 days ago

I think the approach is really cool but the processing time required is too much for this to be very useful at the moment.

On a 1080 Ti it takes 45-90 minutes to train networks for the various tasks on 256px images (depending on some quality parameters and which task). Each task also requires training individually so if you'd like to try them all for a given image you'll need to train 6 times.

Also the pyramid of GANs approach is very memory hungry. I was only able to get up to 724px images with 11 GB of VRAM. This was also only possible with a higher scale factor (sparser pyramid) which sacrifices a lot of quality and is incredibly noticeable at larger image sizes. I only tried for larger sizes with the animation task though, perhaps there is a way to combine the super resolution and animation task and achieve better results. Training on larger sizes was taking upwards of 6-8 hours.

All of this was tested with the official repo[1] about a month ago.

[1] https://github.com/tamarott/SinGAN

1 comments

spunker540 2399 days ago

Could you put that in context in terms of $$? How much does it cost in aws/gcp/azure to run a 1080 ti for 90 minutes?

Would you say that could be a downside of the single image approach? Rather than feeding images into a generalized model you’re training a whole model per image which is costly to scale?

link

marcyb5st 2399 days ago

You can't run nVidia consumers cards in datacenters. You need to use the expensive versions. However, for a one-off of 90 minutes they still come cheap (like https://cloud.google.com/products/calculator/#id=cd604b5c-76... )

link