| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jameswhitford 3 hours ago
	Hi, author here, I cannot give an exact number for how many token the verification step took, but the verification GLM 5.2 ran was very stupid and definitely a waste of time. It read the pixel color data to try and verify the scene rendered properly. Which is really bad. Opus opened the game in a Playwright browser and took screenshots to verify the actual image. Which helped a lot. Pro tip: You could use a multi-modal model to verify images as a subagent spawned by GLM 5.2, to get around this issue.