Hacker News new | ask | show | jobs
by KasianFranks 390 days ago
This is also where MoE shines with a mixture of small and large language models.