Hacker News new | ask | show | jobs
by conception 150 days ago
https://clocks.brianmoore.com

K2 is one of the only models to nail the clock face test as well. It’s a great model.

3 comments

Kimi 2 is remarkably consistently the best. I wonder if it's somehow been trained specifically on tasks like these. It seems too consistent to be coincidence

Also shocking is how the most common runner up I've seen is DeepSeek

It's better than most, but not 100%. As I see this the clock hands are all correct, but the numbers only go 1-8.
Cool comparison, but none of them get both the face and the time correct when I look at it.
Refresh. It’s not every time but k2 hits a perfect clock for me about 7/10 or so.