Hacker News new | ask | show | jobs
by outlore 607 days ago
you can stream the response in chunks of size N + K overlap and run the guardrails on each chunk.