Hacker News new | ask | show | jobs
by fchaubard 539 days ago
Yes it will allow stable training at much smaller batch sizes. Test it out and let us know if it works for your use case!