Hacker News new | ask | show | jobs
by memossy 846 days ago
Training on 4096 v5es how did you handle crazy batch size :o