|
|
|
|
|
by fxj
442 days ago
|
|
https://ollama.com/joefamous/QVQ-72B-Preview Experimental research model with enhanced visual reasoning capabilities. Supports context length of 128k. Currently, the model only supports single-round dialogues and image outputs. It does not support video inputs. Should be capable of images up to 12 MP. |
|
That's an earlier version released some months ago. They even acknowledge it.
The version they present in the blog post and you can run in their chat platform is not open or available to download.