| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by oshrimpton 2 hours ago
	I'd definitely agree that it isn't directly model size, but there is the fact that a larger model in terms of parameter count needs a large amount of training data to not overfit or underfit. So I think this race to the top of "max training data size" has kind of led to unintentional overfitting, not catastrophically, but enough to trigger this perceived omniscience within the model