Hacker News new | ask | show | jobs
by raffael_de 3 hours ago
Just tried "generate an SVG of a pelican riding a bicycle" for Claude Opus 4.8 Max and of course both legs on same side ... the smartest publicly available model by Anthropic (after Fable) doesn't even successfully simulate understanding the concept of a bicycle.
1 comments

Yet it can write code better than 99% of humans…

It’s just starting to be trained on svgs, which is a really hard problem

"99% of humans" is a low bar. Maybe you mean "99% of people who earn money by developing software"?
LLMs can't really "see", so I challenge you to draw a pelican on a bike without any visual feedback, just code. Because that is how they are doing it.

Vision tokens for transformers aren't really well solved yet, which is why they can smash a phd math problem and trip over a "count the cats on the chair" problem.