| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pvarangot 637 days ago
	It's because it's probably trained with "professional audio", ads, movies, audiobooks, and not "normal people talking". Like the effect when diffusion was mostly trained with stock photos.