Hacker News new | ask | show | jobs
by rrobukef 1382 days ago
Your model is overflowing/underflowing generating NaNs. I got it with memory optimised, increased resolution (multiples of 32, 384 x 384) and full precision while keeping it in 4 GB.