|
|
|
|
|
by embedding-shape
119 days ago
|
|
> Even if you instruct the model "don't do X" or "do X this way"—you cannot rely on the model following that instruction. Why not? I can definitively fire of two prompts to the same model and harness, and one include "don't do X" and the other doesn't, and I get what I expect, one didn't try to avoid doing X, and the other did. Is that not your experience using LLMs? |
|
It makes sense if you remember that it just predicts, what should probably be the next piece of text?