Hacker News new | ask | show | jobs
by aiordienow 54 days ago
Context engineering is where the real leverage is right now. Most people focus on model selection but the retrieval and memory layer around the model makes a bigger difference in practice. What's your approach to managing context window limits — chunking with overlap, or some kind of relevance scoring before injection?
1 comments

As i said in the article, i have a filter for retrieval. I dont elaborate because i want to make it simple to read. You have the good structure, filtering, score relevance for every memories and indexes to facilitate the search ! You can check Supermemory infra its a bit how that works behind on chaaaaa.com