Hacker News new | ask | show | jobs
by mirekrusin 10 days ago
on 2x 4090:

90 t/s for 27B Q8 256k context

260 t/s for 35B-A3B Q8 256k context