| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ani17 102 days ago
	Author here. I wanted to understand what vLLM and llama.cpp are actually doing under the hood, but the codebases are massive. So I wrote a stripped down version from scratch to see the core ideas without the production complexity. Code: https://github.com/Anirudh171202/WhiteLotus