Hacker News new | ask | show | jobs
by takeaura25 121 days ago
Excited to see the improvements in coding benchmarks. I use Claude daily and the jump in reliability from 4.5 to 4.6 has been noticeable, especially for debugging complex multi-step workflows.