| > Good prompts may actually have a moat - a complex agent system is basically just a lot of prompts. The second part of that statement (is wrong and) negates the first. Prompts aren’t a science. There’s no rationale behind them. They’re tricks and quirks that people find in current models to increase some success metric those people came up with. They may not work from one model to the next.
They don’t vary that much from one another.
They, in all honesty, are not at all difficult or require any real skill to make. (I’ve worked at 2 AI startups and have seen the Apple prompts, aider prompts, and continue prompts)
Just trial and error and an understanding of the English language. Moreover, a complex agent system is much more than prompts (the last AI startup and the current one I work at are both complex agent systems). Machinery needs to be built, deployed, and maintained for agents to work. That may be a set of services for handling all the different messaging channels or it may be a single simple server that daisy chains prompts. Those systems are a moat as much as any software is. Prompts are not. |
If one spends a lot of time building an application to achieve an actual goal they'll realize the prompts make a gigantic difference and it takes an enormous amount of fiddly, annoying work to improve. I do this (and I built an agent system, which was more straightforward to do...) in financial markets. It so much so that people build systems just to be able to iterate on prompts (https://www.promptlayer.com/).
I may be wrong - but I'll speculate you work on infra and have never had to build a (real) application that is trying to achieve a business outcome. I expect if you did, you'd know how much (non sexy) work is involved on prompting that is hard to replicate.
Hell, papers get published that are just about prompting!
https://arxiv.org/abs/2201.11903
This line of thought effectively led to Gpt-4-o1. Good prompts -> good output -> good training data -> good model.