Hacker News new | ask | show | jobs
Coalescence: Making LLM inference 5x faster (blog.dottxt.co)
6 points by Homunculiheaded 871 days ago