Hacker News new | ask | show | jobs
by sathish316 96 days ago
If you intercept what an agent (client) sends to the LLM with multiple MCP servers and tools, the context or header is filled with available MCP servers and all tools as part of the conversation.

With a CLI, you avoid sending this context to the LLM and it progressively discovers only what is needed.

The input token costs come down because of using a CLI instead of MCP