Y
Hacker News
new
|
ask
|
show
|
jobs
by
cat_plus_plus
167 days ago
At least for transformers, it can be kind of fixed with MOE + NVFP4 for small working set despite large resident size.