Hacker News new | ask | show | jobs
Toward Inference-Optimal Mixture-of-Expert Large Language Models (arxiv.org)
24 points by zhiQ 804 days ago