Hacker News new | ask | show | jobs
by gertlabs 57 days ago
This is the most underrated release we tested at https://gertlabs.com

I'm surprised they open sourced it. It's very comparable with Kimi K2.6 performance-wise, and slightly better with tools. And it's cheaper.

1 comments

Wow awesome!

BTW I see that deepseek V4 pro is trounced by it's little flash brother? Any ideas as to why?

In our testing, the Pro version underperformed because it struggled in agentic tasks where we use a harness with custom tools it was not trained on. Mostly formatting issues. All of the other major releases in April, including the Flash version, have no problem adapting to custom tools. We do plan to continue adding Pro samples to see if there was an infrastructure degradation component.