Hacker News new | ask | show | jobs
by broalkvam 56 days ago
There are many models that extracts context from real time video. But they often just rely on identifying simpel object and tasks. Not longer context spanning a video.