Show HN: In-Context Index for In-Context Retrieval

Y	Hacker News new \| ask \| show \| jobs

Show HN: In-Context Index for In-Context Retrieval (github.com)

5 points by mingtianzhang 258 days ago

RAG pipelines have become bloated: embeddings, vector DBs, rerankers, and ad-hoc pipelines everywhere.

Projects like Claude Code showed a simpler path: In-Context Retrieval — letting the LLM reason directly over context for retrieval instead of outsourcing search to external infrastructure.

PageIndex takes that one step further with In-Context Indexing.

If retrieval happens in-context, the index should live there too.

Each document is transformed into a hierarchical, human-readable tree structure (like a table-of-contents tree index) inside the model's context window.

The LLM reads the structure, identifies relevant branches, opens them, and reasons through for retrieval — no embeddings, no chunking, no opaque vector indexes the model can't interpret.

Retrieval and indexing, both inside the model.