Hacker News new | ask | show | jobs
by hwoolery 820 days ago
You can look at MediaPipe's BlazeFace / BlazePose for faster inference for lower-end chips. I currently have an application that does realtime video with YoloV8-M on my Macbook at over 60fps
1 comments

Thanks I will check if BlazePose fits my purpose. Maybe I have to train my own model. YOLOv8 is great, but it's weakness seems to be detecting people from an high camera angle, which we have to use to get all the people in the space at once and for practical reasons (cannot mount camera where it can be reached without ladder).

So I might have to annotate my own data and train my own model. I found one top view dataset by ZHDK Zürich and I got my own test footage.

I will figure out how well I can make it run on my old laptop (has a Nvidia GPU at least).