Hacker News new | ask | show | jobs
by EnPissant 86 days ago
That assumes you have significant work to do between fetches (so you can prefetch while using the current data). With LLM decode you don't.