Hacker News new | ask | show | jobs
by Ar-Curunir 1750 days ago
If it works better for inference, it could enable fast inference on devices which don't have good tensor cores/gpus