Hacker News new | ask | show | jobs
by OutOfHere 613 days ago
It has a clever way to decode multiple valid tokens at once, rather than just one token at a time.

Corresponding project link: https://github.com/sgl-project/sglang