Hacker News new | ask | show | jobs
by gerash 1292 days ago
scaling a large language model to serve thousands of queries per second and be continuously updated is not trivial.

I'm sure we'll get there at some point.