Hacker News new | ask | show | jobs
by thylacine222 847 days ago
Whisper can also generate timings at the word level, which you could use to make better-timed subtitles
1 comments

Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]

[0] https://github.com/linto-ai/whisper-timestamped