Hacker News new | ask | show | jobs
by wmertens 55 days ago
Wow awesome!

BTW I see that deepseek V4 pro is trounced by it's little flash brother? Any ideas as to why?

1 comments

In our testing, the Pro version underperformed because it struggled in agentic tasks where we use a harness with custom tools it was not trained on. Mostly formatting issues. All of the other major releases in April, including the Flash version, have no problem adapting to custom tools. We do plan to continue adding Pro samples to see if there was an infrastructure degradation component.