| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fragmede 922 days ago
	I'm not going to argue that there isn't a difference when going from 0 -> 1 and 1 -> 10, or in this case, from 1.5b (Whisper-large) -> 1.7t parameters (gpt-4). But it's not like we don't know how to do it, so it won't take 10 years to get there.