| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by shenberg 906 days ago
	I suspect that weight initializations are geared towards inputs being normal random variables with mean 0 and variance 1. Deviating from that makes the learning process unhappy.