Hacker News new | ask | show | jobs
by mcharytoniuk 747 days ago
Yes, exactly. You can split the available context into "slots" (chunks) so it can handle multipe requests concurrently. The number of them is configurable.