|
|
|
|
|
by eerop
814 days ago
|
|
Thanks. I just tried it, it's definitely fater, but still, sometimes it takes >3 seconds (my app requires the completion to be done in <3 seconds). I've tried to optimize it by reducing token length and other methods, but I'm wondering if there's any better LLMs |
|