Y
Hacker News
new
|
ask
|
show
|
jobs
by
takeaura25
121 days ago
Excited to see the improvements in coding benchmarks. I use Claude daily and the jump in reliability from 4.5 to 4.6 has been noticeable, especially for debugging complex multi-step workflows.