Hacker News new | ask | show | jobs
by amindiro 180 days ago
Agreed, this observation holds true for both decode and prefill. Thanks for sharing the code