| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sigmoid10 808 days ago
	And what do you think epochs in machine learning are? Or why more modern training efforts (i.e. for LLMs) are focussing hard on deduplicating scraped data?

1 comments

llm_trw 808 days ago

Why don't you tell me instead of asking questions that you surely know the answer for?

link

sigmoid10 807 days ago

It was rhetorical. But in case you actually don't know: what you described (i.e. multi sampling) has been common practice in ML for ages. Only now the latest models are getting so big that people are actually trying hard to move away from this idea because it would take a human lifetime in wall clock time to train a cutting edge LLM on similar datastreams.

link