|
|
|
|
|
by cocoflunchy
3 days ago
|
|
I may have misremembered but I thought I had read somewhere that recent models by OpenAI and Anthropic tend to produce reasoning that is not always understandable for humans. But you're right that it's not the case for Deepseek so maybe I'm hallucinating ;) Or maybe it was an article or a tweet about researchers trying really hard to steer the model to think in English otherwise interpretability / safety becomes a lot harder? |
|