Hacker News new | ask | show | jobs
by eerop 814 days ago
Thanks. I just tried it, it's definitely fater, but still, sometimes it takes >3 seconds (my app requires the completion to be done in <3 seconds).

I've tried to optimize it by reducing token length and other methods, but I'm wondering if there's any better LLMs