Hacker News new | ask | show | jobs
by vardump 74 days ago
Is it possible to accomplish tagging with local AI instead of Gemini?
1 comments

As far as I've seen, local OSS video understanding models just really aren't there yet. I briefly looked at facial recognition models but a good amount of signal was actually in the video's audio instead of the raw video frames. Depends on the accuracy you're looking for at the end of the day.
Thanks for the reply. Let's hope local models catch up.