Hacker News new | ask | show | jobs
MoE expert co-activations: Reordering inputs yields easy throughput gains (blog.doubleword.ai)
2 points by kkm 7 days ago