Hacker News new | ask | show | jobs
by IxInfra 77 days ago
This is really interesting, especially the chunking and parallel Haiku approach.

Curious how it holds up as note volume grows. At some point you're still doing N relevance checks per tool call. do you hit a scaling limit there or does chaching keep it manageable? Also wondering if you've seen any drift in relevance when notes become more numerous or overlapping