Hacker News new | ask | show | jobs
by shagie 1174 days ago
It is possible to tune the model, and I suspect that remains something that is done on a regular basis.

https://platform.openai.com/docs/guides/fine-tuning

In particular from https://platform.openai.com/docs/models/gpt-3

> With the release of gpt-3.5-turbo, some of our models are now being continually updated. In order to mitigate the chance of model changes affecting our users in an unexpected way, we also offer model versions that will stay static for 3 month periods. With the new cadence of model updates, we are also giving people the ability to contribute evals to help us improve the model for different use cases. If you are interested, check out the OpenAI Evals repository.

The feedback wouldn't be immediately injected back into the model (human curation of the responses is needed to see if the feedback is appropriate).

Some of the feedback would be used to train the moderation / supervisor model. https://platform.openai.com/docs/models/moderation