I was busy adding `chat template` support to vLLM recently, so the model (and any others that implement it properly) will work seamlessly with a clone of the OpenAI chat/completions endpoint.
https://github.com/vllm-project/vllm/pull/1756
Now that I have that out of the way, back to model training ;).
I was busy adding `chat template` support to vLLM recently, so the model (and any others that implement it properly) will work seamlessly with a clone of the OpenAI chat/completions endpoint.
https://github.com/vllm-project/vllm/pull/1756
Now that I have that out of the way, back to model training ;).