|
|
|
|
|
by atlex2
56 days ago
|
|
A couple drawbacks so far via our scenario-based tests: 1. You can't ask the model to "think hard" about something anymore - model decides
2. Reasoning traces are no longer true to the thinking – vs opus 4.6, they really are summaries now
3. Reasoning is no longer consciously visible to the agent They claim the personality is less warm, but I haven't experienced that yet with the prompts we have – seems just as warm, just disconnected from its own thought processes. Would be great for our application if they could improve on the above! |
|