Hacker News new | ask | show | jobs
DeepSeek V4 in vLLM: Efficient Long-Context Attention (vllm-website-pdzeaspbm-inferact-inc.vercel.app)
2 points by Palmik 57 days ago