| Ok, sure. …but people are getting the mistaken impression that this is an actual system, running actual commands. I can also emulate a docker container. I’ll just write down the commands you send me and respond with some believable crap. …but no one is going to run their web server on me, because that’s stupid. I can respond hundreds of times a second and maintain the internal state required for that. Neither can this model. It’s good, and interesting, but it’s not running code, it’s predicting sentences and when you’re running software it was to be accurate, fast, consistent and have a large internal data state. Trying to run docker in gpt is fun. Trying to use docker in gpt to do work is stupid. It’s never going to work as well as actually running docker. It’s just for fun. Models that write code and the execute that code will be in every way superior to models that try to memorise the cli api of applications. It’s an almost pointless use of the technology. Gpt may have “learnt” python; that’s actually interesting! Docker is not interesting. If I want to use the docker api, I can type `docker` on my computer and use it. It's pretty sad that the thing that excites people the most about an amazing new language model is that it can do trivial command line actions, that you can do without the model. Spending millions of dollars to produce a model that can do what you can already trivially do is very seriously not what openai just did. |
Right. The thing that is impressive is that ChapGPT can do this effectively. This means that it has some "understanding" of how `pwd`, `ls`, `apt`, `docker`, etc all work. In some sense, this is an AI that knows how to read code like a human instead of like a machine.