| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by noogle 1221 days ago
	Models are re-trained periodically (months, weeks, even days), and new architectures/implementations come all the time. If a better algorithm appears, practitioners will adopt a new platform (e.g. Transformers for NLP models), so many systems can already plug-in new tools. GPUs are very expensive so there is also a strong incentive to make this little effort.

1 comments

floatngupstream 1221 days ago

Yes, but this just makes a frictionless runtime for inference even more important (which is something that does not exist in a comparable form for AMD).

link