Hacker News new | ask | show | jobs
by onoesworkacct 88 days ago
LLMs already use mixture of experts models, if you ensure the neurons are all glued together then (i think) you train language and reason simultaneously