Hacker News new | ask | show | jobs
CLLMs: LLMs can be taught to parallel decode with up to 3.5x speedup (twitter.com)
3 points by snyhlxde 770 days ago