Hacker News new | ask | show | jobs
by allemagne 456 days ago
> Expecting our users to write a pitch-perfect prompt is like asking the average person to control their computer via the command line.

I admit I lean more on the "command line pilled" side of things, but to me this analogy implies that the chat interface, specifically prompting, are fundamentally intractable as an interface to AI.

Every operating system is necessarily built on compiled/interpreted text, and therefore has the ability to "fall back" to a command line interface to have finer grained control over basically everything.

Even mobile devices, completely 100% GUI, where native terminal emulators are unusable (except for the truly dedicated) are wired this way as soon as you scratch the surface of app development.

I think this analogy has to hold for AI, no? The entire current ChatGPT renaissance is built on the concept of prompting. There are no examples of breaking the mold in this article that don't rely on an underlying "chat" paradigm. Voice agents? Just turning a vocal interface (something predating GPT3) into... prompts for an LLM. Gemini summarizing documents? A button that has a prompt pre-written and ready to go.

Carrying forward the analogy, there will be a "GUI" where your AI agent will provide a streamlined experience that is extremely powerful in matching your context most of the time, but then if you want to tweak anything yourself one of the most powerful ways to do this is to get down into the "command line" of what chat interfaces control what or how these underlying prompts are structured/ordered.