|
|
|
|
|
by kcb
251 days ago
|
|
What's the benefit to running LLMs locally? Data is already remote, LLM inferencing isn't particularly constrained by Internet latency. So you get worse models, performance, and battery life. Local compute on a power constrained mobile device is required for applications that require low latency or significant data throughput and LLM inferencing is neither. |
|
At work:
That I don't rent $30,000 a month of PTUs from Microsoft. That I can put more restricted data classifications into it.
> LLM inferencing isn't particularly constrained by Internet latency
But user experience is