Y
Hacker News
new
|
ask
|
show
|
jobs
by
busfahrer
27 days ago
Great, thanks! :-) and to mirror another poster: what kind of prompt parsing (prefill) speed do you get for that model? Also how is the speed for the 27B model?
1 comments
egorfine
26 days ago
35B: 1300-1800 t/s on both Q4 and Q6.
27B: give me 20 minutes
link
busfahrer
26 days ago
Thank you, good sir!
link
27B: give me 20 minutes