Hacker News new | ask | show | jobs
by Gigachad 496 days ago
They have already streamed the first part of the response before the filtered phrase has even been generated.
1 comments

Could you stream the raw tokens into a server side filter which then streams censored tokens at near real time?