Hacker News new | ask | show | jobs
by amilios 1064 days ago
Yeah unfortunately I think this is the result of the stochasticity of sampling from the LLMs with non-zero temperature, it'll give a different answer every time and some answers might trigger the guardrails and others might not. I am curious if the greedy-sampling answer contains the guardrails or not...