Hacker News new | ask | show | jobs
by the_king 132 days ago
I just tested it on a very difficult Raven matrix, that the old version of DeepThink, as well as GPT 5.2 Pro, Claude Opus 4.6, and pretty much every other model failed at.

This version of DeepSeek got it first try. Thinking time was 2 or 3 minutes.

The visual reasoning of this class of Gemini models is incredibly impressive.

1 comments

Deep Think not DeepSeek