Hacker News new | ask | show | jobs
1.5x Faster Moe Training with Custom MXFP8 Kernels (cursor.com)
1 points by sshroot 283 days ago