Hacker News new | ask | show | jobs
by whalesalad 923 days ago
I'm dying.
1 comments

"Do you know ffmpeg supports OCR? I haven't found the command yet, but it does support it."

This is probably 80% of my experience with ffmpeg, to be honest, but the other 20% is invaluable enough anyway.

I thought this was a joke, but it really exists! https://ffmpeg.org/ffmpeg-filters.html#ocr
Lol what. I have a bot that processes ~50 videos a day, burning in translated whisper-generated subtitles. It also translates images using Tesseract, then overlaying texts in-place. I once thought of exporting frames as images to maybe do this for video too, I actually did not even start to think FFMPEG would have tesseract support on top of everything.

Later on though I've realized the quality of tesseract's OCR on arbitrary media is often quite bad. Google translates detection and replacement is so much ahead my current image system I'd think I would just somehow reutilize that for my app, either thru public API or browser emulation ...

I'm glad we've reached a point where there is quality parody content online for our industry.
Check out Krazam. I quote their Microservices video on a regular basis (https://youtu.be/y8OnoxKotPQ)
THIS!!! It was so refreshing!
That was one of the funniest things I've seen in a while!!!! I had to stop drinking my decaf for fear of spitting it all over my computer I was laughing out loud so much!

(ps: and no, it's not Rick Astley/Never Gonna Give You Up)

The artfully inserted, corrupted predicted frames was :chefskiss: