Hacker News new | ask | show | jobs
by yu3zhou4 60 days ago
An open course on building high performance LLM inference engine! Hope to finish by the end of April

https://github.com/jmaczan/tiny-vllm