Hacker News new | ask | show | jobs
by bisonbear 101 days ago
curious how you measure/track how this actually impacts the coding agent?
1 comments

Fair question. I haven’t done a systematic benchmark yet, so I don’t have hard numbers to point to. Honestly I’ve mostly been iterating from actual use. The main test has been whether it helps me keep the good parts of brainstorming with the agent, recover context across longer multi PR or multi session work, and reduce friction overall. So right now the evidence is mostly qualitative and based on my own workflow, not a formal evaluation.