Hacker News new | ask | show | jobs
Show HN: Local video search with Qwen3-VL: no API, runs on Apple Silicon, GPUs (github.com)
3 points by sohamrj 75 days ago
Last week, I posted SentrySearch, a CLI for semantic video search using Gemini's embedding API. The #1 request was local model support.

Turns out Qwen3-VL-Embedding can natively embed video into the same kind of vector space, no API, fully offline. Runs on Apple Silicon (MPS) and NVIDIA GPUs (CUDA). The 8B model needs ~18GB RAM, or use the 2B model on smaller machines.

sentrysearch index /path --backend local

Also added: similarity threshold to suppress weak matches, and a Tesla metadata overlay that renders speed/location onto matched clips.

Details on the README.