Y
Hacker News
new
|
ask
|
show
|
jobs
by
comova
391 days ago
I believe this is to improve performance by shortening the context window for long thinking processes. I don't think this is referring to real-time summarizing for the users' sake.
2 comments
usaar333
391 days ago
When you do a chat are reasoning traces for prior model outputs in the LLM context?
link
int_19h
391 days ago
No, they are normally stripped out.
link
j_maffe
391 days ago
> I don't think this is referring to real-time summarizing for the users' sake.
That's exactly what it's referring to.
link