Y
Hacker News
new
|
ask
|
show
|
jobs
by
mightytravels
924 days ago
From what I can see it is parallel batch processing - default for that repo is 24. You can reduce batches and if you use 1 it's as fast or slow as Whisper. Quality is the exact same (same large model used).