| Year ago with enough prompting it could write a function for me that did something conceptually simple but tedious. I tried Windsurf over the last 3 days. It can write complex game mechanics logic in multiple files and refactor code when told to. I can talk to it using language of the domain I just made up and I can talk to it using language that developers use when they talk about the code. It can solve bugs that would take me half an hour to figure out at least. It went from a poor junior to clever junior and half a senior in one year. It's not great at one shot architecting yet, but when told how to architect a thing it can move stuff around to conform with the desired architecture. It creates and modifies SVG's, understands colors and directions. It does all of that blind, without running the code. All it has is compiler (language server really) and linter. You mention it's an autocomplete. I barely used auto-complete at all. I just told it what to do. I touched the code a little bit only because I wanted to, never because I had to. When I wanted a change to the code, 95% of the time I just told it what change I needed and it made it. It basically replaces about 1/3 of software team already enabling (and pushing me towards) a role of (tech lead/architect/qa/product owner) and this fraction is rising. It's not flawless. Sometimes it makes things that don't work on the first try. But you don't always need to revert. It's capable of fixing what it made when told about how the desired behavior differs. And all that without running the program. I anticipate that further development is going to be letting it run and observe what it built so it's gonna move towards QA role. And the other thing might be slowly learning to differentiate between well architected code and badly architected code but that's probably harder as it requires careful preparation (and creation) of training data. While moving towards QA just requires figuring out how to let it run and inspect stuff. Maybe use the debugger. |
I'm doing everything you're doing and probably more with Claude code (I 'let it run'). I'd say I too have been through a few of the different phases of the gartner hype cycle now.
As I said it's a good tool, it's far from being an "auto-pilot".