Y
Hacker News
new
|
ask
|
show
|
jobs
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)
(
furiosa.ai
)
9 points
by
olibaw
251 days ago