Hacker News new | ask | show | jobs
by jiehong 48 days ago
Using it in Kagi Assistant is stupidly slow. I get like 10 t/s.

While it’s pretty fast in the official app for example.

Kagi Assistant is also kind of broken when using Qwen 3.6 Plus.

So, beware of using them in Kagi at the moment.

1 comments

Probably a provider thing. Looking at https://help.kagi.com/kagi/ai/llms-privacy.html, they're using deepinfra.

Looking at https://openrouter.ai/deepseek/deepseek-v4-flash/providers tells us that the deepseek provider achieves 49tps of throughput while deepinfra 19tps.

Thanks for taking the time to provide this info. I appreciate it