|
|
|
|
|
by programmarchy
444 days ago
|
|
With OpenAI models, my understanding is that token output is restricted so that each next token must conform to the specified grammar (ie json schema) so you’re guaranteed to get either a function call or an error. Edit: per simonw’s sibling comment, ollama also has this feature. |
|
The Gemma model by itself does not though, nor does any "raw" model, but many open libraries exist for you to plug into whatever local framework you decide to use.