Hacker News new | ask | show | jobs
by buildbot 3 days ago
The 120B and 20B GPT-OSS models by OpenAI did this last year for what it’s worth; the MoEs where MXFP4