| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hhh 116 days ago
	Most modern models can dispatch MCP calls in their inference engine, which is how code interpreter etc work in ChatGPT. Basically an mcp server that the execution happens as a call to their ai sandbox and then returns it to the llm to continue generation. You can do this with gpt-oss using vLLM.