Hacker News new | ask | show | jobs
by ag2718 15 days ago
You're correct that this work is not very applicable for LLMs and that the focus here is primarily on latency.