Hacker News new | ask | show | jobs
by bitshiftfaced 920 days ago
I wonder if Google sees MoE as a sort of local maxima, and so they tried a different path hoping it might outperform it.