| Hi HN, I built OculOS because giving AI agents (like Claude Code or Cursor) control over desktop apps is still surprisingly difficult. Most current solutions rely on slow OCR/Vision or fragile pixel coordinates. OculOS is a lightweight daemon written in Rust that reads the OS accessibility tree and exposes every button, text field, and menu item as a structured JSON API and MCP server. Why this is different: Semantic Control: No screenshots or coordinates. The agent interacts with actual UI elements (e.g., "Click the 'Play' button"). Rust-powered: Single binary, zero dependencies, and extremely low latency. Universal: Supports Windows (UIA), macOS (AXUIElement), and Linux (AT-SPI2). Local & Private: Everything runs on your machine; no UI data is sent to the cloud. It also includes a built-in dashboard for element inspection and an automation recorder. Iām looking forward to your feedback and technical questions! |