Y
Hacker News
new
|
ask
|
show
|
jobs
by
danielbln
30 days ago
A3B is especially nice, MoE really shines on memory bandwidth contained platforms like the DGX Spark.
1 comments
verdverm
30 days ago
looks like MTP support has now been merged and also updated unsloth quants to go with it (not just the extras, all of 'em!)
link