Hacker News new | ask | show | jobs
by rjvs 980 days ago
The “mixture of experts” concept in LLMs is a way of training a single model, it’s not based on training many different models (although that was the idea when the term was originally coined).