|
|
|
|
|
by kosolam
22 days ago
|
|
Hey, this is most probably related to the chat template or the reasoning parser or the tool call parser or also things like kv cache quantization and possibly other params that affect results like the regular top k top p and all of that, the backend often sets its own defaults or the lack of them. It’s best to have all these under control if possible. I wonder regarding this project have you been testing it on real world projects? I’m working on an agentic loop as well also using a local model. |
|
As for consumers, I've done a home assistant, an agentic coding harness, and an autonomous engineering project (still in flight).