|
|
|
|
|
by Wowfunhappy
137 days ago
|
|
> For my tastes telling me "no" instead of hallucinating an answer is a real breakthrough. It's all anecdata--I'm convinced anecdata is the least bad way to evaluate these models, benchmarks don't work--but this is the behavior I've come to expect from earlier Claude models as well, especially after several back and forth passes where you rejected the initial answers. I don't think it's new. |
|
I wish I remembered the exact versions involved. I mostly just recall how pissed I was that it was fighting me on changing a single line in my go.mod.