Hacker News new | ask | show | jobs
by raw_anon_1111 82 days ago
For the customer service scenario, that’s completely impractical. The latency would be horrible. In my experience, I have to use the simplest fastest model I have available (in my case Nova Lite) to get quick responses.