Y
Hacker News
new
|
ask
|
show
|
jobs
by
buildbot
3 days ago
The 120B and 20B GPT-OSS models by OpenAI did this last year for what it’s worth; the MoEs where MXFP4