Hacker News new | ask | show | jobs
by retinaros 415 days ago
That might already happen behind what they call test time compute
1 comments

Many models that use test time compute are MoEs, but test-time compute is generally meant to refer to reasoning about the prompt/problem the model is given, not about reasoning about which model to pick, and I don't think anyone has released an LLM router under that name.
we dont know what OAI does to find the best answer when reasoning but I am pretty sure that having variations of a same model is part of it.