Hacker News new | ask | show | jobs
by wastemaster 105 days ago
We deployed AI agents across 25 hospitality properties and logged ~46,000 guest conversations. The main failure mode wasn’t tone or retrieval. It was “confident gap-filling”: the model promising operational outcomes nobody had verified. This post is about the production failures we saw and the constraints we added to stop them.