Hacker News new | ask | show | jobs
by heavyset_go 85 days ago
When I did this, I used a single local llama.cpp server instance as my main model without setting a small model and it did not use it for chat titles while I used it for prompts.

Chat titles would work even when the local llama.cpp server hadn't started, and it was never in the the llama.cpp logs, it used an external model I hadn't set up and had not intended to use.

It was only when I set `small_model` that I was able to route title generation to my own models.

1 comments

Maybe it was for older version?