|
|
|
|
|
by eborgnia
393 days ago
|
|
Hey -- good question! We're focused on a narrower task right now that aims to save frontier tokens (both input & output). Our merge + retrieval models are simply smaller LLMs that save you from passing in too much context to Sonnet, and allow you to output fewer tokens. These are cheap for us to run while still maintaining or improving accuracy. |
|
What’s the differentiator or plan for arbitrary query matching?
Latency? If you think about it - not really a huge issue. Spend 20s-1M mapping an entire plan with Gemini for a feature.
Pass that to Claude Code.
At this point you want non-disruptive context moving forward and presumably any new findings would only be redundant with what is in long context already.
Agentic discovery is fairly powerful even without any augmentations. I think Claude Code devs abandoned early embedding architectures.