|
|
|
|
|
by youssefabdelm
924 days ago
|
|
Does insanely-fast-whisper use beam size of 5 or 1? And what is the speed comparison when set to 5? Ideally it also exposes that parameter to the user. Speed comparisons seem moot when quality is sacrificed for me, I'm working with very poor audio quality so transcription quality matters. |
|
Side note, the insanely fast whisper readme gives benchmarks on an A100 but only the FA2 lines were. The rest were on a T4 looking at the notebooks/history. Turing doesn't support FA2 so the gap should be smaller with it, but based on the distil-whisper paper CTranslate2 is probably still faster.
TensorRT-LLM might be faster but I haven't looked into it yet.