Y
Hacker News
new
|
ask
|
show
|
jobs
by
yu3zhou4
60 days ago
An open course on building high performance LLM inference engine! Hope to finish by the end of April
https://github.com/jmaczan/tiny-vllm