|
|
|
|
|
by SV_BubbleTime
675 days ago
|
|
My guess since programmer blog post writing (plus autism?) assumes “Everyone already knows everything about my project because I do!”… Is this to the effect of running a local LLM, that reads your prompt and then decides which correct/specialized LLM to hand it off to? If that is the case, isn’t it going to be a lot of latency to switch models back and forth as most people usually run the single largest model that will fit on their GPU? |
|
The reason this is cool is because it allows you to integrate with things like Home Assistant, so you can ask your chat bot or whatever to actual take actions. "Hey bot, turn on the lights in the basement" as an example.