Hacker News new | ask | show | jobs
by joakleaf 37 days ago
Seems like a pull request for vLLM was just approved a few minutes ago:

https://github.com/vllm-project/vllm/pull/41745

("Add Gemma4 MTP speculative decoding support")