| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by 6gvONxR4sf7o 808 days ago
	The tensor cores that do the bulk of the flops on the bulk of the gpus people use are just various sizes of floats, i think. We're in a funny position where progress in models and progress in hardware are kind of linked. As far as expressive power goes, it shouldn't make a difference for the models in common use, but I could totally imagine models where it improves readability.