Hacker News new | ask | show | jobs
by spiderfarmer 1002 days ago
I could use this for my project but most of my videos don't have any dialogue or voice overs. It would be perfect if it described the actual (visual) video content.
1 comments

For now it transcribes the audio of the video using Whisper.cpp; but what you say is a good feature that I will be reviewing.