Hacker News new | ask | show | jobs
by sigmar 931 days ago
and whisper large is 1.55B parameters at 16bits instead of 4 bits, I believe. so nano-1 weights are ~1/3rd the size. Really impressive if these benchmarks are characteristic of performance