Hacker News new | ask | show | jobs
by londons_explore 632 days ago
Transformers run well on GPU's or other hardware accelerators. This benchmark doesn't allow GPU's.

That makes it more of a "can I use unsuitable hardware to get the job done fast and accurately enough" challenge, rather than a pure math puzzle of how to encode data with fewer bytes.

I suspect that's why there is only 1 Transformer entry, and to me raises the question whether the rules should be updated to allow GPU's now they are fairly commonplace.

1 comments

I think it might be a moot point since the transformer run times scale very poorly and the algorithm has a symmetric run time.