|
A lot of people here haven't integrated GPT into a customer facing production system, and it shows gpt-4, gpt-4-turbo, and gpt-4o are not the same models. They are mostly close enough when you have a human in the loop, and loose constraints. But if you are building systems off of the (already fragile) prompt based output, you will have to go through a very manual process of tuning your prompts to get the same/similar output out of the new model. It will break in weird ways that makes you feel like you are trying to nail Jello to a tree There are software tools/services that help with this, and a ton more that merely promise to, but most of the tooling around LLMs these days gives the illusion of a reliable tool rather than results of one. It's the early days of the gold rush still, and every one wants to be seen as one of the first |
[2]: https://insurtechdigital.com/articles/chatgpt-the-risks-and-...
--- please disregard [1] it was a terrible initial source I pulled of Google
[1]: https://medium.com/artivatic/use-of-chatgpt-4-in-health-insu...