| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jeffchuber 12 days ago

If you are truly bitter-lesson pilled - give the agent all the tools and let it decide which to use.

- regex (grep) - hybrid search (bm25+vector)

this X vs Y is uninteresting when the answer can be both.

5 comments

pastel8739 12 days ago

That assumes that the agent knows which one is better. And to bake in which one is better via post-training would require a study like this to establish where each one works well

link

fnordpiglet 12 days ago

I’ve got a custom ultra high performance streaming semantic search I exposed as a tool and the RL bias in Claude is almost insurmountable without copious and consistent steering. Codex will follow instructions and use the tools I ask it to but for gods sake between Claude asking to take a nap because it’s getting late in the session and it regressing to RL biased tools like grep it’s maddening. When I can get it to use my compositional tools tool calls drop from like 20-50 to 3-4, but it’s almost impossible to steer.

link

mediaman 12 days ago

Anthropic is, I believe, fully pursuing the idea that you shouldn't use their model with anything but their own products. They don't care whether it generalizes.

I agree it's very frustrating to use with custom tools/harnesses that can speed up the process for domain specific purposes.

link

bachittle 12 days ago

Exactly this, and this tool called qmd is what I use for the hybrid search portion. It also uses local LLMs to provide summaries on your own markdown data too. My agents use both depending on what type of search they are doing, and both provide good results.

https://github.com/tobi/qmd

link

budududuroiu 12 days ago

Both is usually the right answer, since you can use LLMs to do query expansion and effectively increase the recall performance of your retrieval algo

link

dominotw 12 days ago

it will only use tools it was trained on? what's the benfit of givig it all the tools.

link

jeffchuber 12 days ago

then you are not agi pilled

link

worthless-trash 12 days ago