| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by simonw 355 days ago
	There are some spectacular local models for generating text descriptions of images now. I suggest starting with Mistral Small 3.2, Gemma 3 and Qwen 2.5VL - all available via Ollama. I expect we will see a Qwen 3VL soon.