Hacker News new | ask | show | jobs
New Recipe: Serving Llama-2 with VLLM's OpenAI-Compatible API Server (github.com)
1 points by zhwu 1028 days ago