|
|
|
|
|
by cnbeining
1204 days ago
|
|
I have a pipeline setup in https://github.com/cnbeining/Whisper_Notebook/blob/master/Wh... . - Run Voice Activity Detection for better timestamp output
- Transcribe with Whisper
- Run Forced Alignment to get per word timestamp
- Create better segmented SRT
- Translate(with multiple APIs - implemented DeepL, Google Translate, Baidu and a couple more) |
|