| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by atlex2 103 days ago

A couple drawbacks so far via our scenario-based tests:

1. You can't ask the model to "think hard" about something anymore - model decides 2. Reasoning traces are no longer true to the thinking – vs opus 4.6, they really are summaries now 3. Reasoning is no longer consciously visible to the agent

They claim the personality is less warm, but I haven't experienced that yet with the prompts we have – seems just as warm, just disconnected from its own thought processes. Would be great for our application if they could improve on the above!