Y
Hacker News
new
|
ask
|
show
|
jobs
by
zingelshuher
810 days ago
Intuitively looks like models should be close enough, or sparse enough for merge to work. I wonder if MoE experts can be merged(?)