|
|
|
|
|
by lolinder
616 days ago
|
|
This doesn't solve the critical problem, which is that you usually can't tell if something is okay until you have context that you don't yet have. This is why even SOTA models will backtrack when you hit the filter—they only realize you're treading into banned territory after a bunch of text has already been generated, including text that already breaks the rules. This is hard to fix because if you don't wait until you have enough context, you've given your censor a hair trigger. > Combine both streams with SSE on the front end and don't render the content stream result until the validation stream returns "OK". Just a note that this particular implementation has the additional problem of not actually applying your validation stream at the API level, which means your service can and will be abused worse than it would be if you combined the streams server-side. You should never rely on client-side validation for security or legal compliance. |
|
For most consumer use cases, it probably doesn't matter if a few tokens leak before the about, especially if they're not rendered.
Tune it to your needs :)