| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ritwikgupta 133 days ago
	VLMs are selectively blind — they decide how much to look at an image based on question framing (open-ended vs Yes/No vs MCQ), even when the same visual reasoning is required.