| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by charleshmartin 138 days ago
	Right. If the dynamics of training are governed by RG flow, then the best optimization path should remove redundant directions, as specified by the RG operator(s)