|
|
|
|
|
by est31
85 days ago
|
|
You have a lot of control over LLM quality. There is different models available. Even with different effort settings of those models you have different outcomes. E.g. look at the "SWE-Bench Pro (public)" heading in this page: https://openai.com/index/introducing-gpt-5-4/ , showing reasoning efforts from none to high. Of course, they don't learn like humans so you can't do the trick of hiring someone less senior but with great potential and then mentor them. Instead it's more of an up front price you have to pay. The top models at the highest settings obviously form a ceiling though. |
|