|
|
|
|
|
by DrProtic
48 days ago
|
|
That’s the thing, not everyone wants and values the model based on that. But I guess it works for you, and that benchmark achieves it. I personally develop with very detailed spec, and I don’t want nothing more and nothing less compared to the spec. I found 5.4/5.5 much better at following spec while Opus makes some things up, which aligns with your benchmark but that makes 5.4/5.5 better for me while worse for you. |
|
What strike me as very strange though is that 0 model were able to just use the search input already present in GravitYForms forms list page and all created a second input.
Also, I know it's not in the prompt, but adding a ctrl+f shortcut to a search input? Is that that crazy? I don't know.