|
|
|
|
|
by deaux
69 days ago
|
|
> If you just do a tiny amount of tok/day and can wait for the answer to be computed overnight or so But they can't? The usage pattern is the polar opposite. Most people running these models locally just ask a few questions to it throughout the day. They want the answers now, or at least within a minute. |
|