Y
Hacker News
new
|
ask
|
show
|
jobs
by
outlore
607 days ago
you can stream the response in chunks of size N + K overlap and run the guardrails on each chunk.