Hacker News new | ask | show | jobs
by rfw300 928 days ago
The main thing is chat is just one application of LLMs. Other applications are much more latency sensitive. Imagine, for instance, an LLM-powered realtime grammar checker in an editor.