Hacker News new | ask | show | jobs
by 6gvONxR4sf7o 761 days ago
The tensor cores that do the bulk of the flops on the bulk of the gpus people use are just various sizes of floats, i think. We're in a funny position where progress in models and progress in hardware are kind of linked.

As far as expressive power goes, it shouldn't make a difference for the models in common use, but I could totally imagine models where it improves readability.