|
|
|
|
|
by nilsherzig
809 days ago
|
|
Thanks :) Die LLM makes these decisions on its own. If it writes a message which contains a tool call (Action: Web search Action Input: weight of a llama) the matching function will be executed and the response returned to the LLM. It's basically chatting with the tool. You can toggle the log viewer on the top right, to get more detail on what it's doing and what is taking time. Timing depends on multiple things:
- the size of the top n articles (generating embeddings for them takes some time)
- the amount of matching vector DB responses (reading them takes some time) |
|
You mean the? The German is bleeding through haha