|
|
|
|
|
by tra3
1229 days ago
|
|
This is fascinating. Can I train it on 5 years of stream of consciousness morning brain dumps and then say "write blah as me"? Before I do that, I'd love to know if training data becomes part of the global knowledge base available to everyone.. |
|
This is popular because it's much much easier to do effectively than fine tuning and the OpenAI model is very capable of integrating kb snippets into a response. What I have heard is that it's easy to overdo fine tuning with OpenAI's model and makes more sense when you want a different format of response rather than just pulling in some content.
Having said all of that, they do have a fine-tuning endpoint and I am guessing if you find the right parameters and give it a lot of properly formatted training data then it will be able to do an okay job. I have the impression it is not easy to do either of those things quite right though.
As far as privacy, no they will not share your data when you use the API. ChatGPT is different, they ARE using the inputs to train the model.