Hacker News new | ask | show | jobs
by lrvick 38 days ago
The qwen models not only have good OCR, they will describe pictures to you.
2 comments

They not not only describe pictures. They can analyze pictures. Detect anomalies. Create 3d models out of it.
Anyone wanna do a quick offline MVP on a general vision assistant for the blind? We've had things like Google Lens for a while, but it's a bit vision and touchscreen-centric.