|
|
|
|
|
by cyp0633
325 days ago
|
|
That is not the case here - I never encountered this with whisper-large-v3 or similar ASR models. Part of the reason, I guess, is that those subs are burnt into the movie, which makes them hard to extract. Standalone subs need the corresponding video resource to match the audio and text. So nothing is better than YouTube videos which are already aligned. |
|
*Although it used to be more common for AVI files in the olden days.