Hacker News new | ask | show | jobs
by zingelshuher 810 days ago
Intuitively looks like models should be close enough, or sparse enough for merge to work. I wonder if MoE experts can be merged(?)