Y
Hacker News
new
|
ask
|
show
|
jobs
by
cma
40 days ago
I think it also gets use in the /fast modes the providers sell at higher cost.
1 comments
gunalx
40 days ago
They probably use it on all models. Fast is probably just a resource pool with less congestion and therefore faster throughput per user but less efficent.
link
cma
39 days ago
If it speeds prefill too I guess so.
link