Hacker News new | ask | show | jobs
by gblargg 19 days ago
BTW there's a summary today that has bullet-point markup and it just wraps around without proper formatting. Title is "Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA".