Hacker News new | ask | show | jobs
by immibis 266 days ago
Batch size is just averaging the gradients from multiple calculations.