Hacker News new | ask | show | jobs
by danial 496 days ago
https://archive.ph/Dy9An

I wonder if this is also a CUDA-bypass, PTX optimization that led to the 10x performance gain by Deepseek: https://xyzlabs.substack.com/p/deepseeks-latest-shocker-who-...