Hacker News new | ask | show | jobs
by trebligdivad 207 days ago
Even local, MoE are just so much faster, and they let you pick a large/less quantized model and still get a useful speed.