|
|
|
|
|
by luke-stanley
723 days ago
|
|
Given the goal of mitigating self-proliferation risks, have you observed a decrease in the model's ability to do things like help a user setup a local LLM with local or cloud software? How much is pre-training dataset changes, how much is tuning? How do you think about this problem, how do you solve it? Seems tricky to me. |
|
Literature has identified self-proliferation as dangerous capability of models, and details about how to define it and example of form it can take have been openly discussed by GDM (https://arxiv.org/pdf/2403.13793).
Current Gemma 2 models' success rate to end-to-end challenges is null (0 out 10), so the capabilities to perform such tasks are currently limited.