Hacker News new | ask | show | jobs
by danielbln 30 days ago
A3B is especially nice, MoE really shines on memory bandwidth contained platforms like the DGX Spark.
1 comments

looks like MTP support has now been merged and also updated unsloth quants to go with it (not just the extras, all of 'em!)