Hacker News new | ask | show | jobs
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs) (furiosa.ai)
9 points by olibaw 251 days ago