Hacker News new | ask | show | jobs
by nozzlegear 8 days ago
AI can barely figure out how to make a cartoon pelican ride a bicycle.
2 comments

Generating SVG code and generating an image are two different things.
What would the LLM generate more accurately: an svg of a pelican on a bike, or an svg of a gory, dead woman?

The medium is superfluous.

> The medium is superfluous.

No it's not. Generating an SVG is asking the model to write text that can be rendered as an image.

Generating an image directly skips the intermediate step and directly outputs an image: so the accuracy is in a completely different league. The models people ask for SVGs typically cannot do this, only specially post-trained variants can.

(An LLM will do the svg of a pelican on a bike much more accurately btw.)

AI does fine at that. LLMs have problems generating SVGs of that, but that's kind of an (intentionally) particularly obtuse test.