| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sethcronin 97 days ago
	I guess I'm skeptical that this actually improves performance. I'm worried that the middle man, the tool outputs, can strip useful context that the agent actually needs to diagnose.

2 comments

ivzak 97 days ago

You’re right - poor compression can cause that. But skipping compression altogether is also risky: once context gets too large, models can fail to use it properly even if the needed information is there. So the way to go is to compress without stripping useful context, and that’s what we are doing

link

backscratches 97 days ago

Edit your llm generated comment or at least make it output in a less annoying llm tone. It wastes our time.

link

thebeas 97 days ago

That's why give the chance to the model to call expand() in case if it needs more context. We know it's counterintuitive, so we will add the benchmarks to the repo soon.

Given our observations, the performance depends on the task and the model itself, most visible on long-running tasks

link

fcarraldo 97 days ago

How does the model know it needs more context?

link

thebeas 97 days ago

We provide the model with a tool, we call expand() that allows the model to get access to more context if needed by using it.

We state this directly appended into the outputs so the model knows exactly where the lines were removed from.

link

kingo55 97 days ago

Presumably in much the same way it knows it needs to use to calls for reaching its objective.

link

Zetaphor 96 days ago

I'd argue not, as with tool calls it has available to it at all times a description of what each tool can be used for. There's plenty of intermediate but still important information that could be compacted away, and unless there was a logical reason to go looking for it the model doesn't know what it doesn't know.

link