Hacker News new | ask | show | jobs
by leoedin 825 days ago
> computer vision has been improving, yes, but nothing earth shattering in the last 5 years

I totally and completely disagree. Sure, "computer vision" industrial cameras doing edge detection haven't changed much, but the computer vision my phone can do is many orders of magnitude better today than it was 5 years ago.

There's tools now that can take a short video of your bookcase and identify every book. That's serious progress!

Edit: This is the example I was referencing https://simonwillison.net/2024/Feb/21/gemini-pro-video/

Breaking down video into tokens for large language models and asking for structured data out. That's ground breaking compared to any non-LLM style machine vision.

1 comments

I agree there is cool stuff going on in vision, absolutely. But I wasn’t taking about the field in general.

I just don’t think it moves the needle significantly in this particular area. For example, structured data out of a single camera is way better than it was 5+ years ago, but it isn’t as good as a dedicated multi sensor setup (ie state of the art for robotics) and that in turn isn’t good enough for the problems in GP post - which was the point.