The model does have the format specified but there is no _one_ standard. For this model it’s defined in the [
tokenizer_config.json [0]. As for llama.cpp they seem to be using a more type safe approach to reading the arguments.
Hm, but surely there will be converters for such simple formats? I'm confused as to how there can be calling bugs when the model already includes the template.