|
|
|
|
|
by prymitive
1 hour ago
|
|
For me the biggest issue with AI coding model is not that it writes code, or how it writes code, or whether it will replace me or not. I mean, these are real problems, but just not the ones that makes the difference to me daily. Instead what sets my feeling about the AI is simple: the experience of working with AI models, because it's the worse experience of my life so far. Every single LLM will make up stuff, go down rabbit holes I didn't even ask it to visit and infuriate me all day long by doing ALMOST what I asked it to do, but just not quite. Take "yes" to one question as an approval to some other question. Come up with a plan but one you review the plan and accept it it hits some minor issue and then throws away the plan and do whatever it wants. Or when I reject some line of code or don't approve a command it wants to run 9/10 times it just tries to re-add or re-run it as if my rejection was meaningless. And no amount of rules and markdown files ever seems to change its behaviour for long, if at all.
It's like working with a sociopath who just doesn't remember anything that happened 5 minutes ago.
It's the classic case of "my computer doesn't do what I want, only what I tell it". And all of that would be fine if it didn't pretend to be a human, if all the UX didn't create the illusion that you interact with some intelligent being, because as long as I remember that this is a cli tool it's all good, I manage my expectations, but the experience sooner or later makes you annoyed and frustrated - and if this was a person they would either stop or you wouldn't never work with them again. |
|
This is my exact experience. LLMs get 95% of the way to my personal quality bar, and for simple tasks 100% of the way.
More complex things, no. This means that, in practice, I end up having to understand the code pretty much as much deeply as I would anyway, without agents.
So, essentially, my experience is that it's automated away the easy bits, but left the hard bits, so all of my time is spent doing the hard bits, which is mentally exhausting.
Maybe Fable would be good enough to get to 100% of my quality bar on more complex tasks, but I never got chance to try it.