| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by coolness 1985 days ago
	This, not to mention one could get the GPU usage on the V100 way higher by training with larger batch sizes, which would also make training much faster.