|
|
|
|
|
by epolanski
5 hours ago
|
|
To me DS 4 is still the most interesting due to much lower costs. Also DS 4 training isn't done yet. From my Opus vs DS 4 Pro personal benchmarks, 16 different real-life work tasks, DS 4 has performed as well as Opus 4.8 high overall but with few drawbacks: - on the 16 tasks, one needed several prompts to be steered back into the topic - its review capabilities seem much worse - DS4 had the cleanly better solution in 3 cases out of 16, with Opus "only" doing cleanly better 2 times out of 16. But still, I want to emphasize, is the worst case scenarios that imho matter the most, not the best ones, and on that front Opus outperformed. That being said I spent less than 2$ of API working 4 days, which is more or less what I would've spent with Anthropic APIs for less than one task. |
|