Hacker News new | ask | show | jobs
by P-MATRIX 94 days ago
Same trajectory here. The skepticism fades fast once you see it handle a real refactor across multiple files. The part that still bugs me is there's no good way to measure when the agent starts drifting — it just silently gets worse mid-session and you don't notice until you're debugging its output.