It automates video fetching and uses whisper to generate .srt, .vtt and .txt files.
[1] https://github.com/zackees/transcribe-anything