|
|
|
|
|
by ilaksh
90 days ago
|
|
These little 5.4 ones are relatively low latency and fast which is what I need for voice applications. But can't quite follow instructions well enough for my task. That's really the story of my life. Trying to find a smart model with low latency. Qwen 3.5 9b is almost smart enough and I assume I can run it on a 5090 with very low latency. Almost. So I am thinking I will fine tune it for my application a little. |
|