Hacker News new | ask | show | jobs
by gatienboquet 446 days ago
Isn't "thinking" in image mode basically what chatgpt 4o image generation do ?
1 comments

Not at all. GPT-4o is image output - this model (and previous Qwen release QvQ - https://simonwillison.net/2024/Dec/24/qvq/) are image input only with a "reasoning" chain of thought to help analyze the images.