Hacker News new | ask | show | jobs
Set Block Decoding Is a Language Model Inference Accelerator (arxiv.org)
4 points by veryluckyxyz 291 days ago