Hacker News new | ask | show | jobs
by hoverbot 305 days ago
Author here. We run chatbots with a smart router: classify noise/PII first, call retrieval+LLM only when needed, and use deterministic flows for actions. This keeps latency and cost down and makes behavior explainable. What would you add or change?