Hacker News new | ask | show | jobs
by PaulHoule 335 days ago
I believe, one way or another, that the poster of that article is correctly evaluating the performance of CC now. Whether it was better before or whether he was looking at it with rose tinted glasses now is beside the point.
1 comments

What's really missing in this ecosystem is someone who runs the same problems against the same codebase repeatedly, and publishes weekly results on all LLM-based coding tools.

It seems like Playwright now allows you to control an Electron app, like a VSCode fork. The CLI tools like Claude Code are easy to include in automated testing.