Hacker News new | ask | show | jobs
by voiper1 1961 days ago
Should be able to use ffmpeg[0] to extract a single frame each second/keyframe (doubtful it's worth doing every single frame) and then pass it to tesseract.

For speech to text.. if english, try mozilla's deepspeech? https://github.com/mozilla/DeepSpeech

Might be fun to try.

[0] https://stackoverflow.com/questions/27568254/how-to-extract-...

1 comments

Yup, was planning to use ffmpeg (or, more likely, OpenCV), and a subset of the frames.

Thanks so much for the tip on DeepSpeech!