|
|
|
|
|
by manquer
335 days ago
|
|
There are plenty out of band(non prompt) controls , it just requires more effort than system prompts. You can control what goes into the training data set[1],that is how you label the data, what your workload with the likes of Scale AI is. You can also adjust what kind of self supervised learning methods and biases are there and how they impact the model. On a pre trained model there are plenty of fine tuning options where transfer learning approaches can be applied, distilling for LoRA all do some versions of these. Even if not as large as xAI with hundreds of thousands of GPUs available to train/fine tune we can still do some inference time strategies like tuned embeddings or use guardrails and so on . [1] Perhaps you could have a model only trained on child safe content alone (with synthetic data if natural data is not enough) Disney or Apple would be super interested in something like that I imagine . |
|