|
|
|
|
|
by amilios
1064 days ago
|
|
Yeah unfortunately I think this is the result of the stochasticity of sampling from the LLMs with non-zero temperature, it'll give a different answer every time and some answers might trigger the guardrails and others might not. I am curious if the greedy-sampling answer contains the guardrails or not... |
|